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PREFACE 


This book is intended for the student of physics or chemistry who 
wishes to use quantum mechanics. Not much space, therefore, is de¬ 
voted to the foundations of the subject; in particular no attempt is 
made to present the foundations in their most general form, as is done 
in the classical treatises of Dirac and of Pauli, for example. Schrodingers 
wave equation for a single particle in an electrostatic field is shown to 
follow from diffraction experiments ; generalizations of the equation 
for the magnetic field, the many-body problem, for the spin, and for 
relativistic velocities are introduced as and when the physical sub¬ 
jects under discussion make it necessary. This treatment, though it 
lacks the elegance of the more general formulation, has the advantage 
of keeping close to experimental fact. It should appeal to the research 
worker who wants to learn the subject in order to explain new facts of 
physical science, rather than to the man whose interest is in the logical 
basis of quantum mechanics, and in the improvements necessary in 
those parts that remain unsatisfactory. 

We are grateful to. Dr. A. F. Devonshire and Dr. K. Cuggenheimer 
who have read parts of the book in manuscript or in proof, and made 
suggestions for its improvement. 

N. F. M. 
I. N. S. 

April 1948 > ^ 
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INTRODUCTION 


Quantum or Wave Mechanics is the name given to a system of equations 
which must be used instead of Newton’s Laws of Motion in order to 
calculate the behaviour of atoms and electrons and other ultimate 
particles of matter. Newton’s laws have proved adequate for the 
mathematical description of a very wide class of phenomena, including, 
for instance, the motion of the planets round the sun and the motion 
of projectiles under the influence of gravity and air resistance. When, 
however, they are applied to the behaviour of electrons, and in particular 
to the motion of electrons within the atom, the answer that they give 
is not in agreement with experiment. Consequently it has been neces¬ 
sary to invent a new system of equations which gives as completely as 
possible an answer which does fit the facts. These equations, while 
profoundly different from Newton’s laws, give very nearly the same 
results except when applied to particles of extremely small mass such 
as electrons; this will be shown in Chapter I of this book. Therefore it 
must not be thought that tw o different sets of equations exist, one for 
atoms and electrons and one for the heavier bodies of everyday ex¬ 
perience ; Newton’s laws can be considered a special case of the more 
general quantalf laws. 

The first step in the development of the new r system of mechanics is 
due to the German physicist Max Planck, who in 1900 made the assump¬ 
tion that the energy of light was quantized, in order to explain the 
spectrum of black-body radiation.£ He introduced into physics a new 
constant of nature, Planck’s constant. In 1905 Albert Einstein made 
use of Planck’s assumption to account for the photoelectric effect. In 
1913 Niels Bohr, the great Danish leader of theoretical physics who was 
working at that time wdth Rutherford in Manchester, made use of the 
idea of quantization and of Planck’s constant to give a quantitative 
theory§ of the line spectrum of hydrogen. In this theory, the outlines 
of which are given in Chapter II of this book, Newton’s Laws of Motion 
were accepted as the correct equations for the electrons in atoms. In 
the hydrogen atom the single electron was therefore assumed to revolve 
round the positively charged nucleus in circular or elliptic orbits. The 
new assumption was that only certain of these orbits were allowed, 

■f The word ‘quanta!* has been introduced as an adjective to describe that which 
pertains to the quantum or to quantum mechanics. 

t M. Plank, Ver. d. Deut. Phys. Oesellschafi, 2, 237 (1900). 

§ N. Bohr, Phil. Mag. 26, 1 476, 857 (1913). 
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those for which the angular momentum was an integral multiple of 
hj2rr, h denoting Planck’s constant. 

In the hands of Niels Bohr in Copenhagen, Arnold Sommerfeld in 
Munich, and a number of other investigators, a fairly detailed theory 
of atomic and molecular spectra was developed along these lines ;f this 
was known as the Quantum Theory. The theory was brilliantly success¬ 
ful in many directions and its formulation was the work of a man of 
genius; nevertheless it became apparent in a very few years that it, too, 
gave the wrong answer in many cases, and between 1924 and 1927 the 
completely new system of equations which we call Quantum Mechanics 
was put forward. In this system Newtonian mechanics is abandoned 
completely and the quantization of Bohr’s theory appears as a con¬ 
sequence of the equations. These developments were carried out under 
the guidance and inspiration of Niels Bohr, by a large number of 
investigators—in particular, de Broglie in France,J Schrodinger in 
Switzerland, Heisenberg, Pauli, and Born in Germany,§ and by Dirac 
in England.|| The theory was originally presented in two different 
forms, the ‘Matrix Mechanics’ and the ‘Wave Mechanics’. Both 
formulations are mathematically equivalent and lead to the same 
results. In most of this book the methods of Wave Mechanics will be 
used, because they are most suitable for the applications of the theory 
to the simpler problems of physics and chemistry. 

We shall not attempt in this book to follow closely the historical 
development of the subject. Before introducing the equations of 
quantum mechanics, we shall give an outline of the experimental facts 
which have made the development of these equations necessary. We 
shall then show how these facts force us to invent new laws of motion 
quite different from Newton’s laws. These laws will then be developed 
and applied to a number of physical phenomena. 

t See, for example, N. Bohr, 4 On the quantum theory of line spectra ’, D. Kgl. Danske 
Vid.-Selsk. Shifter, Nat. Math, iv, 1 (1918); Vber die Quantentheorie der Linienspektren 
(Braunschweig, 1923). 

X L. de Broglie, Ann. de Physique, 3, 22 (1925). 

§ W. Heisenberg, Zeits. f. Physik, 33, 879 (1925); M. Bom and P. Jordan, ibid. 34, 
858 (1925); M. Bom, P. Jordan, and W. Heisenberg, ibid. 35, 557 (1926); E. Schrddinger, 
Ann. der Physik, 79, 361, 489, 734 (1925). 

|| P. A. M. Dirac, Proc. Roy. Soc. A, 109, 642 (1925); 112, 661 (1926); 113, 621 (1927); 
114, 243, 710 (1927). 
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THE SCHRODINGER EQUATION 

1. The wave nature of matter 

1 . 1 . Interaction of matter and radiation 

Onk of the most obvious facts about light is that opaque bodies cast 
shadows, and that in casting shadows light appears to travel in straight 
lines. It was this fact which led Sir Isaac Newton to put forward his 
corpuscular theory of light, according to which energy is conveyed away 
from a source of light in the form of the kinetic energy of a stream of 
particles. The reason why the corpuscular 

theory was abandoned, and replaced by E C 

the wave theory, will be familiar to every 

reader of this book, namely that light A 

shows interference. The interference is n, 

shown in its simplest form in Fig. 1. A n. 

point source of light 8 is separated from ^ ~p 

a screen CD by an opaque screen KF 

with pin-holes at A and />, 8A being p ^ 

equal to SB. The pin-holes will both act fic». 1. Tntorlbronco oxj>orim<mt. 
as sources of light. If the nature of light 

were corpuscular, the illumination on the screen CJJ from the pin-holes 
A and B could be added together; thus the illumination at all points 
on the screen is greater when both holes are open than when one is. 
Since, however, light is propagated as a wave motion, this is not the 
case; the two sources reinforce each other at points P such that 
PA — PB is zero or an integral multiple of a wave-length, but interfere 
destructively if PA—PB differs from this by a half wave-length, so 
that the two sources of light spreading from A and B are out of 
phase. 

All direct ways of measuring the wave-length of light, such as the 
use of the diffraction grating, are based on the interference principle. 
This is true also of methods for measuring the wave-length of X-rays 
by crystal spectroscopy. 

Maxwell’s theory of the electromagnetic field has shown us what it 
is that oscillates in a light wave. According to this theory, it is the 
electric vector E } defined as the force that would act on a unit positive 
electric charge at any point, and the magnetic vector 11 defined in an 

5072 B 



2 THE SCHRODINGER EQUATION § 1 

analogous way. A plane polarized wave, moving in the 2 -direction and 
with its electric vector in the ^-direction, will have the form 

E x = A sin 27r(vt—z/\), E y = E z = 0, 

H y = A sin 2n{yt — 2 /A), H x = H g = 0. 

Here v is the frequency and A the wave-length; if the wave is in a vacuum 
the wave velocity is c (= 3x 10 10 cm./sec.), so that 

vX = c. 

We now know that electromagnetic waves can exist with a great variety 
of wave-lengths between several kilometres and that of the hardest 
y-radiation, less than 10~ 12 cm. Down to values of the order 10 -12 cm., 
measurements have been made by methods of crystal spectroscopy, 
though for shorter wave-lengths there is no direct evidence for the 
wave-nature of the radiation. 

For radiation of the wave-lengths used in radio transmission, that is 
to say, greater than a few centimetres, analysis based on Maxwell’s 
theory appears completely satisfactory, both as regards the transmis¬ 
sion of the waves and their reaction on the receiving aerials, f In the 
receiving aerials currents are induced; these can be calculated in two 
ways which are mathematically equivalent; either by considering the 
electric vector E as being responsible for setting up an electromotive 
force along the aerial, or the magnetic flux threading the circuit as 
inducing a current in it. 

When we turn to radiation of shorter wave-lengths, that is to say, 
to light and to X-rays, the position of the wave theory in general and 
Maxwell’s theory in particular is less satisfactory. It appears to be 
correct when applied to problems of propagation such as reflection, 
diffraction, and refraction of radiation; in fact, as already emphasized, 
the facts of diffraction and interference compel the adoption of a wave 
theory. But it does not correctly describe what happens when light 
is observed, or more precisely when light gives up its energy to matter. 
When this happens, phenomena always occur which suggest that light 
consists of a stream of particles rather than a continuous train of waves. 
An appreciation of this dual nature of light is essential for an under¬ 
standing of quantum mechanics, and we must consider it in some detail. 

Leaving out of consideration the human eye, the most important 

j* Quantum effects with centimetre waves have, however, been observed, especially 
connected with nuclear spins. Cf. the report by J. B. M. Kellogg and S. Millman, Rev. 
Mod . Rhys . 18, 323 (1940). 
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ways in which light and radiations of shorter wave-length are observed 
and measured are: 

1. Photography; 

2. Photo-conductivity, or the property that light of suitable wave¬ 
length has of making cortain solids and liquids conduct; 

3. The photo-electric effect, or the property that light and especially 
ultra-violet light has of releasing electrons from the surfaces of 
solids or from gas molecules. 

Modern research (cf. Chap. VIII) has shown that the primary action 
of light on photographic emulsions is to make the halide grains photo¬ 
conductors; in discussing the quantum nature of light, therefore, it 
need not be considered separately. The photo-electric effect shows 
most clearly the properties of light which we wish to emphasize, and 
this will bo discussed first. 

What happens when light falls on the surface of a metal or of some 
other conducting material is briefly as follows: if the frequency of the 
light is too small, there is no emission of electrons. As the frequency 
is increased, emission begins at a definite frequency r u . If the frequency 
v is greater than v 0 , electrons will be emitted with kinetic energies loss 
than or equal to a maximum value W max given by 

H^nax = h{v— •'o)- 

h is here Planck’s constant. This equation and the experimental facts 
on which it is based are perhaps the logical basis for the definition of 
this constant. 

These facts show that, when light gives up its energy to matter by 
ejecting photo-electrons, it behaves as though it were a stream of 
particles each of energy hv. The energy hv 0 will represent the minimum 
energy necessary to eject an electron from the surface. All this was 
pointed out by Einstein in 1905; the hypothesis of the quantization 
of light had already been introduced by Planck to explain the facts 
about black-body radiation. 

The phenomenon of photo-conductivity points in the same direction. 
All photo-conducting substances show a lower limit v 0 above which 
the frequency of the radiation must he if the effect is to occur. On the 
basis of Einstein’s hypothesis hv 0 will be the minimum energy that an 
electron must absorb in order to escape from the molecules or atoms 
of the solid into a free state in which it can move through it. The 
nature of this free state need not concern us here and will be discussed 
in a later chapter. The important thing to notice is that a lower limit 
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of frequency exists below which the effect does not occur. The existence 
of a lower limit is common to all photo-electric and photo-chemical 
effects and in fact to every mechanism whereby light gives up energy 
to matter except that of inducing currents in conductors. Here there 
is no lower limit, as we know from the behaviour of radio waves which 
induce currents in aerials, the energy of which can be dissipated as heat 
or absorbed in a receiver. This apparent escape from the quantized 
nature of light is connected with the existence of free electrons in 
metals. 

We must examine further the nature of these light quanta, and since 
they move with the velocity r, equal to ,‘lx 10 10 cm./sec., we must make 
use of the formulae of the Principle of Relativity in discussing their 
properties. According to this principle, the energy of a particle of mass 
m moving with velocity v is 



For small values of v/c this reduces to wc 2 -\-bnv 2 ; the second term is 
the familiar kinetic energy, the first represents the energy that would 
be liberated if the particle were annihilated. Now, from this formula 
it is apparent at once that since a light quantum moves with velocity c 
and has finite energy, its mass must be zero. A light quantum there¬ 
fore must be thought of simply as a little bundle of energy, and when 
it loses its energy it has no mass or other measurable attribute, and can 
be considered to cease to exist. 

The question arises whether a light quantum can give up part of its 
energy to matter, or whether it always gives up all or none as in the 
photo-electric effect. The effects associated with the names of Ram an | 
and of Compton $ show that it can. We cannot discuss these fully until 
we have treated the quantization of atomic energy levels; briefly we 
may say that under certain conditions light after being scattered by 
gas molecules may have a component with lower frequency than the 
original. In terms of the quantum hypothesis, part of the energy of 
each quantum is absorbed by each molecule. 

Another formula from the Principle of Relativity is useful, namely 
that connecting kinetic energy (IF) with momentum (p). This is 

W = Cyj(m 2 c 2 +p z ). 

For light quanta, for which m vanishes, this gives 

W = cp . 

f C. V. Raman and R. S. Kriahnan, Nature , 121, 501 (1928). 

% A. H. Compton, Phys, Bev. 21, 483 (1923); I 5 . Debye, Phys. Zeits . 24, 161 (1923). 
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Therefore the momentum of a light quantum is hv/c . This formula 
gives an easy way of calculating the pressure of light, since it shows 
that if light falls on a totally absorbing surface the momentum trans¬ 
mitted is equal to the energy absorbed divided by c . Thus 

c X pressure — energy absorbed per unit area per unit time. 

Also the above formula for the momentum of a light quantum has been 
applied to explain tho Compton effect. 

A beam of light can thus be described in two ways: (1) as a beam of 
particles, each of energy W and momentum p\ (2) as a wave of frequency 
v and wave-length A. Instead of the wave-length it is often convenient 
to introduce the wave-number K, equal to the reciprocal of A, 

K =- 1/A. 


Like the momentum p, K is a vector; for the wave amplitude can be 


written 


sin 2n(K x x+K u y+K z 3-vt), 


so the quantity K x x -f- K y y -}- K z z is scalar; that is to say, it is inde¬ 
pendent of the coordinate axes chosen. Thus since (x,y,z) is a vector, 
so is (A^, K y , K s ). 

Now since the same beam of light can be described alternatively by 
two vectors, p or K, the momentum or the wave-number, it must 
necessarily be assumed that 


p — const. K. 


(1) 


From the formulae alroady given (p — hv/c, K = 1/A -- vjc) it follows 
that the constant is h. 


We may go farther; in the principle of relativity the momentum p 
and the energy divided by c form together what is known as a 4-vector; 
thus (p x ,Py,p~, Wjc) are components of a vector in the space-time 
continuum. So are (K x ,K lP K s ,v/c), the four numbers specifying the 
state of a wave. If the same physical event is to have a particle and 
a wave description it follows that each component of the one vector 
must be proportional to the corresponding component of the other. 


Thus 


W - hv, p - hK, 


where h is some constant. These are equations which we have already 
established on experimental and theoretical grounds, with h equal to 
Planck’s constant. 


This curious dual property of light must now be defined more clearly; 
if we wish to calculate the path of a light beam, through any series of 
slits or gratings, we must use the wave theory. The wave theory gives 
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us at all points of space a value for the electric and magnetic vectors 
E and H and predicts that the energy density there will be 

E*+H* 


8n 


( 2 ) 


But when we come to investigate how light reacts with matter we find 
that it usually gives up its energy in quanta hv, and always acts as 
though it were a stream of particles. Formula ( 2 ), in fact, gives only 
the energy density averaged over a long time in a steady beam. In 
other words, the average number of light quanta per unit volume in 
the beam is ^ 


hv 87 r 

In general, whether we are dealing with a steady beam or not, we can 
say that the 'probability that a light quantum is at a given instant in 
the volume element dxdydz is 

1 E 2 +IP 

hv 87 r 


dxdydz. 


This dual nature of fight was quite unexpected, and was forced on 
theoretical physics by overwhelming experimental evidence. It has 
never been ‘explained’. O 11 the other hand, it has been found that this 
dual nature is not confined to fight, but is common to electrons, atoms, 
and in fact all material particles. It has then to be accepted as a 
fundamental law of nature, on which any scheme of mechanics must 
be built up. 

1 . 2 . Wave properties of material particles 

The evidence that cathode-rays consist of particles, the electrons, 
need not be reviewed here. Our whole system of thought about atoms 
is based on it, as is witnessed by the confidence with which such a state¬ 
ment is made as that the Uranium atom contains 92 extra-nuclear 
electrons. All the arguments of this book w ill be based on the assump¬ 
tion that the electron is a particle of mass m and charge e and that it can 
only be created and destroyed under rather special conditions peculiar 
to high-voltage laboratories, stellar interiors, and ^-active nuclei. 

The discovery that beams of electrons, and indeed beams of heavier 
particles, ions or molecules or neutrons, have wave properties was made 
between 1927 and 1928, by Davisson and Germerf and by G. P. Thom¬ 
son:]: for electrons and later by Stern and his collaborators§ for particles 

t C. Davisson and L. H. Germer, Phys. Rev. 30, 707 (1927). 

t G. P. Thomson, Proc. Roy. 80 c. A, 117, 600 (1928). 

§ F. Knauer and O. Stem, Zeits.f. Physik , 53, 786 (1929), I. Estermann and O. Stern, 
ibid. 61, 115 (1930). 
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of atomic mass. The wave effects—at any rate the most striking ones, 
were shown when electron beams were reflected or scattered by crystal¬ 
line substances, diffraction phenomena extremely similar to those with 
X-rays being obtained. These phenomena enabled the wave-length A 
of the waves to be determined in terms of the energy W or momentum 
p of the particles; the latter can of course be determined for charged 
particles from the accelerating voltage or the deflexion in a magnetic 
field. The experiments showed that the relation was as follows: 

A -= hip. (3) 

This is often written A = hjmVj 

but (3) is correct even for particles moving with relativistic velocities.f 
It will be noticed that (3) is the same relation as that which holds 
between the momentum of a light quantum and the wave-length of 
the corresponding light wave. 

The magnitude of the wave-length is of the order 10~ 8 cm. or less for 
fast electrons, and thus of the order of magnitude of tho diameter of 
any atom. 

In a famous and interesting paper published in 1924 before the 
experimental discovery of the dual nature of electrons, de BroglieJ 
argued that if such dual properties existed, (3) must bo true. His 
argument was the one already put forward in § 1.1; if K, v are the wave 
number and frequency of the wave, p, W the momentum and energy 
of the particle, both form 4-vectors in the space-time continuum. 
Therefore, since all components of (K,y) must be known if (p, If) is 
known and vice versa, we must have 

p = AK, W - hv, 

where h is some constant. The facts already known about light quanta 
suggested that the constant would be Planck's constant. 

We shall return in § 4 to a consideration of the equation W — hv. 

We see, then, that just as for light quanta, when we wish to calculate 
the path of a beam of electrons (at any rate through crystals), we have 
to treat the beam as a wave. The intensity of the wave will be pro¬ 
portional to the density of electrons in the beam; more exactly it gives 
the probability of finding an electron at some point. However, unlike 
the position with regard to light, where Maxwell’s theory of E and II 

f For the experimental evidence for this, of. J. V. Hughes, Phil. Mag. 19, 129 (193.5); 
see also S. C. Curran, ibid. 24, 953 (1937). 

J L. de Broglie, Dissertation (Masson, Paris, 1924); see also Phil. Mag. 47, 446 (1924); 
Ann. de Physique , 3, 22 (1925). 
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already existed, nothing was known about the electron waves. A theory 
had to be built up from the beginning to fit the facts. 

In the literature the wave amplitude is denoted by 0 and is called 
the wave function. As wo have seen, the only way these waves can 
make their presence felt is by the relation between the wave intensity 
and the density of electrons. The simplest assumption that we can 
make is that the density of electrons is proportional to the wave 
intensity. Since we know nothing a 'priori about these waves, it is 
simplest to choose the units in which if/ is measured so that the density 
is equal to the wave intensity. Thus 

\if/\ 2 = number of electrons per unit volume, 
or |0| 2 dxdydz = the probability that an electron is in the 
volume element dxdydz . 


The wave function ib is a complex quantity, and may therefore be 

writtou <A = f+*y, (4) 

where / and (j are real functions of position and time. The intensity 
of the wave, and hence the average number of particles per unit volume 
at any point, is taken to be equal to the square of the modulus of the 
wave function, which is written 0*0 or |0| 2 and defined by 


0 * 0 - 101 2 -/ 2 +? 2 . 

It is often a stumbling-block to the beginner in this subject that a 
physical quantity, the wave function, should be represented by a 
complex function. The reason is as follows. We know a priori nothing 
about the wave function, but we should expect, by analogy with the 
case of light waves, that the type of expression which in other wave 
systems represents the energy density would in this case give the particle 
density. But the energy density in any wave system is always given 
by the sum of the square of kvo independent functions whose magni¬ 
tudes define the state of the wave. For a light wave they are E and II 
and the energy density is f/ 2 )/87r. For elastic waves they are 

the displacement and velocity of the medium. Thus for the waves 
associated with an electron or other material particles, it is reasonable 
to assume that the state of the wave is defined by two quantities / and g 
at any point, and it is convenient to combine them into a single func¬ 
tion 0 by means of equation (4). 

It may be noted that Maxwell’s equations for the electromagnetic 
field in free space may be treated in the same way; the equations are 


—c curl E = 


m 

dt 9 


_dE 
~~ dt 9 


c curl II 
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and if \js is written for both equations become 


ccurh/j 


i 

dt 


It will be convenient at this stage to make a further assumption 
about the form of a plane wave. A plane wave travelling, say, along 
the #-axis has, for any ty pe of wave motion, the form 

A sin 27 t( Kx— vt -f-c), 

where K is the wave number and v the frequency. Tn a plane polarized 
light wave E and H are in phase; thus the energy density is proportional 

sin 2 ‘l^Kx-vtA- e) 

and fluctuates with time at any point. There is no reason to think that 
any such fluctuation occurs in the wave associated with an electron; 
it would in fact be difficult to understand what physical significance 
could be ascribed to a rapid fluctuation of the probability that an 
electron would be found at a certain point. It is therefore reasonable 
to suppose that / and g are 90° out of phase, so that 

f A cos 2ir(Kx—vt), 
g ~ J.sin277(7v#— vt), 
and P+g 2 - A \ 

which keeps a steady value independent of time. Making use of the 
complex function tfj we see that a plane wave is represented by 

l/j JLf>2iri(Kx-vi)' 

We shall represent a wave going in the other direction by 

if) :=_= A £(-A\r - 

In the remainder of this book we shall follow the accepted convention 
and use always the complex wave function 0, and shall not refer again 
to the real and imaginary parts, / and g. 


1 . 3 . The connexion with Newtonian mechanics 

We have seen that when we wish to calculate the path of a beam 
of electrons or of any other material particles, we must postulate the 
existence of a train of waves if* of wave-length hfp; the intensity |(/r| 2 
gives us the average number of electrons per unit volume in any part 
of the wave. The necessity for this procedure is shown by the behaviour 
of electrons in passing through crystals, where scattering centres (the 
atoms) have diameters comparable with the wave-length. In the 
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absence of crystal diffraction, however, Newton’s Laws of Motion have 
been entirely successful in accounting for the behaviour of beams of 
charged particles, in particular their deflexion by electric and magnetic 
fields. It would be very unsatisfactory if we had to assume that 
Newton’s laws were valid in the latter case and wave mechanics in the 
former. Actually, wave mechanics is perfectly general and includes 

Newtonian mechanics as a special case. In 
this section we shall show, for the case of 
deflexion of a beam of electrons by an electric 
field, that wave mechanics gives the same 
result as classical Newtonian mechanics so 
long as the field remains sensibly constant 
over the whole width of the beam. The case 
of magnetic fields is treated in § 8. 

The electron beam is supposed to enter the 
field at 0 in Fig. 2, where the kinetic energy 
is W. The field is supposed to be conserva¬ 
tive, so that at any point of space a potential 
energy function V(x,y,z) can bo defined, 
equal to the work done to take the electron 
from 0 to the point (x,y,z) considered. The kinetic energy there will be 

W—V(x,y,z). 

From the wave point of view the wave-length of the de Broglie waves 
must be hjp. Since at any point ( x t y>z ) 

p 2 2 m(W—V)> 

it follows that the wave-length A at that point is given by 

h/<J{2m(W-V)}. (5) 

The wave-length therefore varies continuously from point to point, 
and therefore the problem of determining the path of the beam is that 
of determining the path in a medium of continuously varying refractive 
index. This can be done quite generally, for any type of wave motion, 
as follows: 

We require at each point along the path of the beam the curvature 
of the beam due to the refraction. In Fig. 2 let AB, A'B' be lines 
drawn along the intersection of two successive wave fronts with the 
plane of the paper. They will meet at P, the centre of curvature of the 
beam. 
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If A is the wave-length at any point, say at BB\ then the wave¬ 
length A A' on the other side of the beam may be written 




where t is the width A B of the beam and djdn denotes differentiation 
along BA. Also denoting PB by R, PA by R~\-t } we have from the 
similar triangles PBB\ PA A' 


or 



( 6 ) 


This formula gives the radius of curvature R of the beam in terms of 
the rate of change of A, or in other words, the rate of change of the 
refractive index. 

In our case, using formula (5) for the wave-length, (6) gives 


1 _ c)V 

R ~ 2 (W-V) dn' 


(V 


But this is exactly the formula that we should obtain using Newtonian 
mechanics. This may be seen as follows: according to Newtonian 
mechanics, the curvature of the orbit of the particles in the beam can 
be obtained by equating the centrifugal force mv 2 /R to the force normal 
to the trajectory exerted by the field on the particles. This is equal to 
dF/dn. We have therefore 

mv 2 dV 

R dn * 


But since mv 2 is equal to 2 (W—V), this is identical with (7). 

So long, therefore, as the field remains sensibly constant across the 
width of the beam, a calculation of the path according to the principles 
of wave mechanics will give the same result as a calculation according 
to Newtonian mechanics. 

A beam must necessarily have a width large compared with the 
wave-length. It is not difficult to see, therefore, that the condition for 
the applicability of Newtonian mechanics is that the kinetic energy of 
the particles should change by only a small proportion in a distance 
comparable with a wave-length. If this condition is not fulfilled diffrac¬ 
tion effects characteristic of wave motion will occur. These are discussed 
in the next sections. 
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2. The wave equation of Schrodinger 

In this section we shall derive SchrOdinger’s equation for the wave 
function 0 associated with a particle or beam of particles, all with the 
same energy, moving in an electrostatic or other field of force. The 
total energy of each particle will be denoted by W , and its potential 
energy in the field of force by V ( x , y, z), as in the last section. Therefore 
we have 

kinetic energy of each particle — W — V(x,y, z). 

Now a beam of waves moving in either direction along the z-axis 
has the form ^ ^ Ac ^n ± Kz^vt)^ 

and both expressions for 0 satisfy the equation 

£f+4**XV-0. (8) 


Conversely, equation (8) means that 0 has the form of a wave of wave- 
number K moving in one direction or other along the z-axis, or else 
that 0 is given by the superposition of two such waves 

0 ^ \Ae l7TiKz -\- Ke~ 2niJCs ]e- 27rivt . 

It will be remembered (cf. p. 9) that the time factor is always of the 
form e~ 2nivf . More generally, a wave of wave-number K moving in the 
direction given by the direction cosines (l, m, n) will have the form 

0 = A (j2niK(lx i my-\ nz)~2TTivt^ /()\ 


Remembering that l 2 ~\~m 2 ~{-n 2 = 1, it will easily be verified that 
(9) satisfies the equation 

V 2 0 +-4tt 2 A 2 0 ^ o, (10) 

where V 2 denotes the operator 


C)X- dy- dz 2 


Conversely, the most general solution of (10) can be formed by the 
superposition of any number of waves of the type (9), with the same 
wave-length but different directions and amplitudes. Equation (10), 
therefore, when applied to a wave function, means simply that the 
wave-length is 1/A". 

Now we know from direct experiment that the relation between the 
kinetic energy W —V of a particle and the wave-number K is 

K 2 = 2 m(W-V)/h\ 


( 11 ) 
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If we substitute for K in equation (10) we obtain the well-known 
equation of Schrodingerf 

v ¥+-p -(^-vw = o. (12) 

If V is constant, or varying very slowly from place to place, this 
expresses nothing more than what we already know about the wave 
function. It is, however, a new assumption that ip will satisfy this 
equation even when significant changes in V occur in distances com¬ 
parable with A, as, for example, within the atom. This assumption will 
be made; it is justified first because of its simplicity, but ultimately 
because it gives results in agreement with experiment. 

In what follows wo shall write this equation in the form 

VV + 5' ( W- F)«A - o (ft = ft/ 2n). 


3. Motion of electrons in one dimension 

In this and in the next section we shall discuss some simple problems 
which illustrate the conditions under which Newtonian and wave 
mechanics give different results, and what the nature of the difference is. 

In particular, our analysis shows that when particles move in fields 
such that W—V changes appreciably in a distance comparable with 
a wave-length, the results differ from classical mechanics. To show 
this we consider the extreme case of a discontinuous 'potential jump’, 
where the kinetic energy of the particles changes discontinuously. 
This may be thought of as the limiting form of a very strong field 
acting over a very short distance. We shall show that if a beam of 
particles is incident on a potential jump, some are reflected and some 
transmitted. According to Newtonian mechanics they would be all 
transmitted or all reflected, according as to whether the height of the 
jump V 0 was greater or less than the kinetic energy W. 


3.1. The effect of 'potential jumps on the motion of an electron 
We consider first the effect of a potential jump on the motion of 
an electron, i.e. we consider the solutions of the one-dimensional 
SchrOdinger equation when the potential function is given by 



(x < 0), 
(x > 0). 


f E. Schrodinger, Ann. der Physik, 79, 361 (1926). See also E. Schrodinger, Collected 
Papers on Wave Mechanics (London, 1928). 



14 


THE SCHRODINGER EQUATION 


§3 


There are then two cases to consider according as V x ^ W, where W is 
the energy of the electron. 

Case (i). V x < W. The differential equations governing the motion 
of the particle are then 

g+“¥ = o (* < o), 

= 0 (x > 0 ), 

where a 2 = ~-( W-7 0 ), P i = ^(W-V 1 ). 


The wave function to the left of the potential step may be taken as 
0 (e i0tx +Ae~ i0iX )e- 2nivi (x < 0), 

consisting of an incident wave e iiocx ~ 27rv V of unit amplitude and a reflected 
wave of unknown amplitude A. When x > 0 we have a solution 

xfi = Be i P x - 27Tivt 9 

representing a wave transmitted through the potential jump. 

The fact that there is a finite discontinuity in the potential function 
V occurring in the Sell rod in ger equation means that if/" suffers a finite 
discontinuity at x = 0. The increase in if)' in crossing the origin is then 
€ 

lim f ifj" dx = lime{^ ,r (~€)+0"(t)} = 0, 

€~*0 " €—►O 


so that ifj } ifV are continuous at x = 0. Thus there are two equations 

1 +A = B, 
a(l— A) = fiB 

for the determination of the constants A and B. It follows immediately 
that o 

A = B 


2ol 


Now the velocity of the particles in the incident and the reflected 

beam is v== ( gOjjzjoj U = 

ml m 9 


so that in an incident beam of unit amplitude the number of electrons 
crossing unit area in unit time is aft/m = N t . The number of particles 
in the reflected beam is 


N r = —AA* = — 
m m 
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where v f = fihjm is the velocity of the transmitted beam. Similarly 
the number of electrons in the transmitted beam is 


m 


= phi 2a \ 2 
m\p+a} 


N, 


4vv' 


i (v 7 +W y 

From these expressions it follows immediately that 

N { = N r +N t1 

so that, as we would expect, the total number of particles is constant. 

This analysis shows the fundamental difference between Newtonian 
and wave mechanics. According to Newtonian mechanics, it would of 
course follow 7 that AJ = 1. 

Case (ii). V x > W. In this case the equation for the wave-function 


ifj in the region x > 0 is 


dx 2 


rV> 


where h 2 y 2 = W); for the transmitted wave we then take the 

exponentially decreasing solution 

tft = Be~v x ~ 2nivl . (13) 

The boundary conditions at x = 0 give 

1 +A = B , i<x(l-A) - y B . 

Solving for A we find A = -—, 

oc-\-ly 

so that AA* = 1, 

showing that the amplitude of the reflected wave is equal to the 
amplitude of the incident wave. Thus all the electrons are reflected. 
If the theory had not predicted this it would have needed revision. 


3.2. The tunnel effect 

We have seen in the last section that the wave function of a beam 
of electrons reflected by a potential jump does not vanish at the point 
x = 0 where, according to the Newtonian interpretation, the electrons 
should be reflected. From this point onwards, according to the formula 
(13), the wave function decays rapidly and is proportional to e~y x . 

We shall now consider the incidence of a beam of electrons upon a 
'potential barrier’ bounded by sharp potential jumps. We shall take 
for the potential energy of the particles 

r 0 (x < 0), 

V 0 (0 < x < a), 

,0 (x > a), 


P(*) = 
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and shall suppose that the kinetic energy W is less than V 0 . The 
Schrodinger equation then assumes the form 

0"-f- ol 2 i/j = 0 (x < 0, x > a) y i/j " = y 2 ijj (0 < x < a ), 

where aft = V(2mW), yft = V{2m(Tj — If)}. Then, just as before, the 
solution of the wave equation for x < 0 will be taken to consist of an 
incident and reflected wave 


$ = (e i0lx -]-Ae- i<xx )e- 27rivi , 
and the solution for x > a, a transmitted wave 

0 ii - }}(>* a£C - 2777 Vt. 

Within the potential barrier, for 0 < x < a, the general solutioji is 
0 (Cey x ~\ ])e~v r y- 2 * ivl . 

The boundary conditions are as before that i/j and i// shall be con¬ 
tinuous; these conditions at x “ 0 give 

l+A - C+I), ia(l-A) - y(C—I)), 

and at x a Cey a -\~l)?~ y(l — Be i,xa , 

y(Ce ya — J)e~ ya ) = iaBc ieut . 

Eliminating A, C, I) from these equations we obtain 

B =z 2^2 cosh -jsinhyaj. 

If we define as transmission coefficient , or transparency, T, the ratio 
of tho number of electrons crossing unit area per unit time in the 
incident and transmitted beams, we have 


T — BB* ~ AI 4 cosh 2 yft ^ ~ j sinh 2 yft . 


If ya is large both cosh ya and si nil ya are approximately equal to leV a 
and the transparency of the potential barrier becomes 




In most cases the amplitude of the wave which ‘penetrates ’ the 
‘potential barrier’ between x = 0 and x = a will be very small. Never¬ 
theless these arguments show that, according to wave mechanics, if a 
beam of electrons is incident on a potential barrier of the type described, 
a small proportion of the total current will penetrate through it. This 
prediction of wave mechanics has been termed the ‘tunnel effect’. 
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The tunnel effect has several important applications. It applies of 
course to protons and to heavier particles as well as to electrons, but 
for heavy particles the proportion that will go through potential barriers 
of atomic dimensions is negligible and the tunnel effect is unimportant. 
For protons and a-particles it is, however, of great importance in the 
theory of the nucleus and of radioactivity, f Electrons will in practice 
pass freely through potential barriers a few electron volts high and of 



Fici. ,3 a. The rectangular barrier. 



thickness up to about ten atomic diameters. This is the reason why 
an electric current will flow between two metals in contact, in spite 
of the layers of oxide or grease with which a metal surface is normally 
contaminated, and the reason why electrons can pass between metallic 
electrodes and ions in solution. 

The wave equation for a triangular barrier can also be solved 

f G. Gamow, lStructure of Atomic Nuclei , chap, v (Oxford, 19.37). 

0 


5072 
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exactly. Consider a triangular barrier defined by the potential function 
(Fig. 36) 

f 0 (x < —V 0 jk) 

y Vq-\~1c>x ( VJk <c x <C 0) 

V 0 —kx (0 < x < VJk) 

^0 (x > VJk), 


The altitude of the triangle is thus V 0 and the slope of the sides ±Ic. 
The Schrodinger equation then becomes 

f'+c Af, = 0 (\x\ > 

r-(p 2 +2«x)4> = o 


I), (14C) 

where a 2 -= 2 rnW/h 2 , fi 2 = 2m(J^— W)/h 2 , and /c = mk/h 2 . 


i/j" —(/ 3 2 — 2kx)i/j = 0 




x < oj 


(14a) 

(146) 


If we substitute £ — (2/c) i (.T-| /3 2 /2^) 


in equation (146) then it becomes 




£<A> 


the solution of which may be writtenf 


*A — CiAi(£)+c 2 Bi(£), 

where Ai(£), Bi(£) are the Airy integral and its companion function 

00 

Ai(£) = i J eos(^2+j3 s ) dz, 

0 

co 

Bi(f) s= ~ J {e^-t« , 4-sin(2^+|z 3 )}rf2. 

0 

Similarly the substitution 

= ( 2K ) i (- a; +^) 


t H. Jeffreys and B. S. Jeffreys, Methods of Mathematical Physics , 447 (Cambridge, 
1946). 
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reduces the equation (14 c) to the same form. We may then write for 
the wave function 


if, = e i<xx+A e - i0i x 

0=OAi(f)+/)Bi(f) 
iff — E F Ri^j) 

if; = 



If we apply the conditions that iff and iff' are continuous at x = 
and at x = 0, we obtain six equations for the six constants A, 72, C, 
D f Ey F. Solving these equations we obtain an expression for B in 
terms of the Airy integrals and companion functions with arguments 
(j8 3 /2/c) 8 and — (a 3 /2/c) 8 . If k is small in comparison with a 3 and jS 8 we 
can replace the Airy integrals by their asymptotic expansions*)* 


Ai (z) 


2V7 T 


z-*exp(—jjs 1 ), Bi(s) ~ -~ r 2 +i exp(| 2 8 ) 


to obtain finally T = BB* = exp(—4/? 3 /3/c) 

for the transparency of the triangular barrier. If we introduce the 
width l of the barrier at the height of penetration, so that 

l V 0 -W 

2 VJk V 9 

. 2(V 0 —W) P* 

we have l = - ---- = —, 

a; /c 

and the formula for the transparency becomes 

T - exp(-P). (15) 


3.3. The WKB method 


A solution of the Schrodinger equation 


d 2 iff ^ 2 m 

dtf'H? 


{W-V)iff^0 


appropriate to the case where W— V is everywhere positive and where 
W— V changes by a small amount in a distance comparable with the 
wave-length A{2m(W—F)} _i can be obtained using a formula due 


f H. and B. S. Jeffreys, loo. cit., p. 470. 
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originally to Jeffreys*! The Schrodinger equation may be written in 
the form \, A 

with f(x) = 2m(IF--F)/S 2 a slowly varying function of x. If we write 
the solution of this equation in the form 

ip = expia(#), 

X 

then in the case where/(a;) is a constant ol(x) is simply/hr or ff*dx. 

o 

If / is not a constant, then oc satisfies the differential equation 

i(x f —a' 2 -f-/ = 0. 

If we assume that a is also a slowly varying function of x so that 

a" < a' 2 , ( 16 ) 


then we have a = db J / J 

^0 

For a second approximation we substitute this solution in the term iot " 
in the differential equation for a and obtain 

Extracting the square root and using the fact that/' is small in com¬ 
parison with / we obtain the equation 

«' = ±P+W~* -(/*). 

which may be integrated to give 

X 

a = ± \pdx+\ilogfK 

Xq 

giving ip — Cf~*exj)(±i jf i dx\, (17) 

where C is a constant. 

This solution is of course valid only in those regions for which the 
condition (16) is satisfied. Using the first approximation for a we see 
that this condition is equivalent to 

/,(?)’ = >'*> < *• 

t H. Jeffreys, Pror. Lond. Math, ft or. (2), 23, 428-36 (1024); Phil. Mag. 33, 451-6 
(1942). 
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According to equation (17) the total number N of electrons crossing 
unit area in unit time is 

N = ifnf**v = CG*f W — F)}* = CC+, 

which is a constant, as it ought to be. 

In the preceding analysis it was assumed that the function f(x) was 
positive in the range considered. If f(x) < 0 we find that the solution 
can be written in the form 

tp = (—-/) -*exp(± | (—f)*dx j. 

To 

Jeffreys also gives formulae for connecting up solutions at a zero of 
the function f(x), which we may take to be at x — a: 0 . Suppose for 
definiteness that 

f(x) < 0, a; > .r 0 ; f(x) > 0, x < x 0 , 
then the connecting formulae can bo written as 

(/) -1 © x P[±i(£+M) <-> (-/)" J [exp M±\i oxp(—Jf)J, (18) 

X X 0 

where M = J (—f) h dx, L = J f l dx. 

fl"o X 

This method of approximating to the solutions of a second-order 
linear equation was first applied to Sehrodinger’s equation by Wentzel,f 
Kramers,J and Brillouin§ and is known usually as the WKB method. 
Higher approximations may be obtained by writing the solution of the 
wave equation in the form 

<A = ex p(| J X dx j> 

Xq 

where ^is a power series in (h/i), 



with coefficients Xm which are functions of x . By substituting in the 
differential equation and equating powers of fl it is found that 

X ' = mw-V)}>, 

t G. Wentzel, Zeits.f . Physik, 38, 518-29 (1926). 
t II. A. Kramers, ibid. 39, 828-40 (1926). 

§ L. Brillouin, Comptes rendus, 183, 24-6 (1926). 
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in agreement with the formula (17), and that 

xl+xi _ 5F' 2 


y* 


X* 


2 Xo 32(2 m)*(W-V)i %(2m)*{W—V)*’ 

and similarly for ^ 3 , x 4 ,... to any desired degree of accuracy. 

A rigorous treatment of this method of approximation has been 
given recently by Kemblef and Titchraarsh.J 

The condition (16) for the validity of this method is that 


< i. 


(19) 


A further condition is that in the neighbourhood of a zero x =-= x x of 
F— W, the function F'(#)(#—a^) shall be a good approximation to 
V—W. 

3.4. The transpare.ncy of potential barriers by WKB method 
In the theory of rc-decay it is of interest to study the motion of a 
particle in a liekl whose potential is of the form shown in Fig. 4.§ 



From the formula (17) the solution representing a wave going from left 
to right in the region x > x x is 

h = j \^{w-v)^dX 

From the connexion formula (18) it follows that the wave function 
for the region x x < x < x 2 which connects up with </r 3 at x = x 2 is 

where M 2 = J j^~(V'— Td^) dx. 

X 

f E. C. Kemble, Phys. Rev. 48* 549 (1935). 

J E. C. Titchmarah, Eigenfunction Expansions associated with Second-order Differential 
Equations , chap, viii (Oxford, 1946). 

§ G. Gamow, Zeits.f. Physik , 51, 204 (1928); E. Condon and R. W. Gurney, Phys . 
Rev . S3, 127 (1929). 
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/ or i 

Writing if 2 = M _ JJ&F- W)j* dx = A-M l 

Xi Xi ' 

we may put the expression for i/j 2 in the form 

The solution joining this at x = is 

& = J^(W-F)J" i {e- <£, (^+le-'=‘)-ie"-(e'-ie-' 4 )}, 

where 2^ — J j— (IF—F)j^ <fo. 

The wave function ip 1 represents an incident wave 

4>t = -»{|™(F)) _1 (e-‘-ie-4 )( ,a, 

and a reflected wave 0 r ; the wave function i/r 3 represents the wave 
transmitted through the potential barrier. The transparency of the 
barrier measured as the ratio of the current density on either side is 

T = vMthhM? = (e A -~le~ 4 y\ 

If e A ;> e~ A , as in the case of a high wide barrier, then 

[ X-2 

- 2 J{~(F-IF ) } i rfx . (20) 

Xi 

This differs slightly from a formula given by (3amow,f who calculates 
not the ratio of current densities but that of probability densities. 
The derivation given here is due to B. Jeffreys. J. 

For a triangular barrier of the type considered in § 3.2 we have from (20) 

T = expl -2 J j^(-p+x)J^-2 J J* dx , 

*- -u V o 

where l = 2(V 0 - W)/k = p 2 /*. 

Thus T = exp( -|(^) i (10 i } = exp(-P), 

in agreement with equation (15). 

A comprehensive account of the determination of the transparency 
of a potential barrier is given in Kemble’s book.§ The application of 

f G. Gamow, Structure of Atomic Nuclei , p. 94, equation (25) (Oxford, 1937). 
t B. Jeffreys, Proc. Camb. Phil . Soc. 38, 401 (1942). 

§ E. C. Kemble, Fundamental Principles of Quantum Mechanics (New York, 1937). 
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the results of calculations of this type to the theory of a-decay is given 
in the monograph by Gamow, but the treatment of barrier problems 
given there is unsatisfactory since the asymptotic expansions are em¬ 
ployed at points where they become infinite. 

The penetration of particles into potential barriers in several dimen¬ 
sions has been considered by Kapur and Peierls.f 


4. Frequency and group velocity of the waves associated with 
material particles 

For any kind of wave motion there exists a definite relation between 
tho frequency v and the wave-length A. For light waves in a vacuum, 
for instance, this relation is 

v = c/A. 

In general we may write v = v/X, 


whore v is the ‘wave velocity’, which may be a constant, or may he 
itself a function of the frequency, as is tho case lor light waves in a 
dispersive medium. 

The wave velocity is the velocity with which the crest of each wave 
travels. The group velocity is the velocity of a wave pulse or of a 
wave front. For instance, if a stone is dropped into water, tho front 
of the disturbance travels out with the group velocity. The time taken 
for a sound to reach our ears is equal to the distance of the source 
divided by the group velocity. All direct ways of measuring the velocity 
of light give this quantity, and it is also the speed with which radar 
pulses are propagated. 

The group and wave velocities are equal only if the wave velocity 
is independent of frequency (as for light in a vacuum). In general the 
formula for the group velocity V a isj 


V a 


dv 


dm 

In terms of our usual notation, K — 1/A 


A 2 


dv 

d\' 


V a = 


dv 

Ik’ 


( 21 ) 


Since v = vK this gives 


r ° = °+ K m- 


Formula (21) is most easily proved by considering the velocity of 


f P. L. Kapur and R. Peierls, Proc. Boy. Soc. A, 163, 606 (1937). 
J C. A. Coulson, Waves , p. 132 (Edinburgh, 1941). 
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the wave groups formed when two wave trains of nearly equal frequency 
are superimposed. For let these two wave trains have the amplitudes 

A sin 2tt(Kx— vt) 
and A sin 27 r(K'x—v't). 


On adding these we obtain 


2 A sin 2tt 


IK+K' v+v'\ 0 /A—A' 

[ 2-— ijcos27r^ - x— 



If K, A' are nearly equal this represents a train of wave groups; each 
wave moves with the wave velocity (H-v')/(A+A'), but each group 
moves with the velocity 

v—V 

K^K n 


which, if v and v are nearly equal, tends to dvjdK. 

The same formula can be shown to give the velocity of a single wave 
pulse. 

The application of the concept of group velocity to wave mechanics 
is as follows: suppose that a beam of 
particles is incident with velocity V on 
a screen, in which there is a hole which 
can be closed by a shutter. The shutter 
is closed to begin with, then opened 
for a time l 0 , and then closed again. 

Obviously a beam of length Vt Q will 
pass through the hole and travel as a 
whole with velocity V. According to 
the concepts of wave mechanics, how¬ 
ever, we must describe the whole phenomenon in terms of the wave 
function i/j. A continuous train of waves falls on the screen; when the 
shutter is opened a train of waves of limited length, that is to say a 
wave group or wave packet passes through, and as usual the intensity 
of the wave is equal to the density of electrons in the beam. This wave 
group must travel with the group velocity of the waves; thus if wave 
mechanics is to give a correct description of observed phenomena, the 
group velocity of the waves must be equal to the actual velocity of 
the particles that they represent. 

Up to the present in this book, in considering steady beams of 
particles, we have not had occasion to define the frequency of the 
waves associated with them. Now, however, we must do so. The 



Fi«. 5. 
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H 


group velocity, by equation (21), is equal to dvjdK . The velocity F of 
the particles is equal to hE/m. We have therefore 


& - f =**/-• 


( 22 ) 


We can integrate this equation and obtain 

v = lhK 2 /m-\- const. 

If we substitute for if in terms of F, we find 

v = lmV 2 /h-\- const. 

We thus see that, apart from an unknown constant independent 
of F, hv is equal to the kinetic energy of the particles with which the 
waves are associated. 

We have next to consider the value of this constant. It will be 


realized that a steady beam of particles moving for an indefinite time 
through a field of force must be represented by a wave of constant 
frequency. It is natural to take a point where the potential energy 
is zero and to define hv as the kinetic energy there. We see therefore 


that 


hv = W, 


(23) 


where W is the total energy of each particle. This is the formula which 
we shall use for the frequency. It must be realized, however, that the 
point where the potential energy vanishes is in fact arbitrary; for 
instance that of a particle acted on by gravity may be considered zero 
on the earth’s surface, or at an infinite distance. The total energy of 
a particle, therefore, docs contain an arbitrary constant; all that has 
physical meaning is the change in the energy. 

It is rather disconcerting to find that the frequency of those waves 
also contains an arbitrary constant; it suggests that, though the equa¬ 
tions of wave mechanics are correct in their description of how matter 
actually behaves, these waves have not the same kind of physical 
reality as sound or electromagnetic waves. This view will be confirmed 
by the considerations of the next section. 

The derivation of equation (23) is non-relativistie. A relativistic 

derivation can be given. We know that K = pjh, where p is the 

momentum of a particle; and the relation between velocity F and 

momentum p is T7/ T72/ 2X 

1 p == raF/V(l-—F 2 /c 2 ). 

We therefore have 


dv 

Ik “ 


F = cp/J(m 2 c 2 +p 2 ) = chK/^(m 2 c 2 -\-h 2 K 2 ). 
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On integrating we obtain 

v = c^(ra 2 c 2 +A 2 i£ 2 )/A-|-const. 

= c<J(m 2 c 2 +p 2 )/h+ const. 

= TF/Aj-eonst. 

We see that as before hv is equal to the total energy W of the particle, 
the kinetic energy together with energy of the rest mass, and also an 
arbitrary constant. 

The derivation given here of the formula (23) for the frequency is 
based on experiment, or rather it shows that if the equation were not 
satisfied wave mechanics would give an incorrect description of the com¬ 
monest facts. It may bo compared with the derivation from the principle 
of relativity given in §1.2. This derivation shows that the arbitrary 
constant should be zero. 

5. The wave equation for non-stationary phenomena 

The wave equation which we have considered already, namely 

V^+^~(W-V)4>^0, (24) 

is applicable to steady beams of particles each particle having energy If; 
the particle density |</r| 2 does not then vary with time. In the pheno¬ 
mena considered in the last section ) */»| 2 does vary with the time; a more 
general wave equation is therefore required. 

This wave function must be of the first order in tho time, and thus 
linear in the operator d/dt. The reason is that in any form of wave 
motion, if the displacement and velocity of the medium are known, 
the further motion is determinate. In electromagnetic waves the same 
is true of E and II. Wo have seen that the real and imaginary parts 
of t/j are analogous to E and II. Thus if i/j is given at any moment its 
future behaviour should be calculable from the wavo equation. It 
follows that this equation must be linear in d/dt. 

The most general form of the wave equation for a particle moving 
in a field of potential energy V will be made up by superimposing wave 
functions of tho type 

ip = ifi w (x,y,z)e- 2nim l h , (25) 

where tfj w satisfies equation (24). This follows from the assumption 
made on p. 12 about the form of the time factor, and from equation 
(23) for the frequency. The functions which are to be superimposed 
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will have different amplitudes and values of W. It will easily be seen 

that 2niW ± 

dt h 


The equation linear in d/dt, from which W has been eliminated and 
which (25) satisfies, is therefore 

h dip h 2 

2tt i dt 87r 2 m 




(26) 


This then is the general equationf satisfied by the wave function ift. 

We must now use this equation to show that wave mechanics will 
give the same results as classical Newtonian mechanics when the field 
of force in which the electrons move does not vary much in a distance 
comparable with the wave-length. This is almost implicit in what 
has gone before; we have shown in §1.3 that a continuous beam of 
waves under these conditions follows the same path as a beam of 
particles according to classical mechanics, and also that the velocity 
of a wave packet is the classical velocity of the particles. The following 
more formal proof may, however, be of interest. 

If i/j represents a wave group or wave packet, the quantity 

x = J xi/nfj* dxdydz j J dxdydz 

will represent the average ^-coordinate of the electrons, information 
about whose position is given by the wave function i/j . It will be 
sufficient if we show that x , y> z satisfy the classical equations of motion. 
We shall in fact prove that 

= - j^H>*dxdydz, 

which shows that the wave packet follows the Newtonian path as 
closely as its finite size will allow. 

The proof makes use of Green’s theorem in the form 

J (AV*B-BV*A)dx= 0 

if the integral is over all space and A and B tend to zero outside a finite 
region. 

We first multiply equation (26) by xi/j* and integrate over all space; 

f For a discussion of the relation of this equation to other wave equations occurring 
in physics and of the analogy between wave mechanics and physical optics see Condon 
and Morse, Quantum Mechanics , p. 10 (New York, 1929); K. K. Darrow, Rev. Mod . 
Phys. 6, 23 (1934). 
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and then multiply the complex conjugate equation by xip and integrate 
over all space; we then subtract the two equations and obtain 

£3 S = 5S3 J 

The integral on the right can be replaced by 


J* {0*#V 2 0—0V 2 (#0*)}dr+2 j 

We thus obtain, since the first term vanishes by Green’s theorem, 

J* 


( dx 

Yt 


hi 

2rr 


■W dT . 


6* 

dx 


(27) 


If we differentiate further with respect to the time we obtain 


d 2 x 


m —- — 
dt 2 


M f ww* , 

2t t J [dt dx ~ rr dxdt) 

M f /dip 00*_ 00* 00\ ^ 

2rr J \0j~0aT~ 0* 


by partial integration of the second term on the right-hand side. If we 
now substitute for 00/0J and 00*/0£ from (26), we obtain for the right- 
hand side 


h 2 

877%?, 



The first term vanishes by Green’s theorem and the second gives 

“ / Tx H ‘* dT - 

We thus obtain the classical equation of motion in the required form. 

We shall also verify that the equation (26) predicts the conservation 
of matter; since |0| 2 dr is equal to the probability that the particle is 
in the volume element dr, it follows that 


J \Wdr 

integrated over all space must be equal to the number of particles 
described by the wave function; it will be equal to unity if the wave 
function is used to describe a single particle. It should therefore be 
possible to deduce from the wave equation that 

|J|^|«rfr = 0. (28) 

If this were not so, the prediction of the equation would not accord 
with common experience, and it would have to be modified. 
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The proof is as follows: the left-hand side of (28) may be written 

If we substitute for di/s/dt and dtp*/dt from equation (26), we obtain 

(^*V 2 i/r-^V 2 i/r*)dT. (29) 

Integrated over all space this gives zero by Green’s theorem, which is 
what we set out to prove. 

If we integrate over a finite volume (29) is equal to 

J (<A* grad</>—</>grad ij>*) n dS, 

where dS is an element of the surface bounding the volume, the inte¬ 
gration is over the whole surface and the suffix n denotes that the 

component of the vector within the brackets normal to the surface 
should be taken. This suggests that the vector 

Ti it 

-(i/r*grad t/r—grad 0*) (30) 

47 TVYl 

should denote the number of particles crossing unit area per unit time. 
Multiplied by the charge on each particle it would give the current. 

It will easily be verified that if i/j has the form of a plane wave 
u4exp(iKr), (30) is equal to A A *v, where v(~ hKfm) is equal to the 
velocity of the particles. 


In 

4:77711 


6. The uncertainty principle 


In §4 it was shown that if two plane waves of wave-number K, K' 
are superimposed, a series of wave groups is formed, and that the length 
of each group, from one minimum to the next, is equal to the distance 
between adjacent zeros of 


cos 2tt 


(K-K' 


-x— 




2 2 
This distance is equal to \j(K~K'). 

If we analyse a single wave group, the same relation is valid approxi¬ 
mately. If a wave group of length Ax and apparent wave-number K 0 is 
analysed into its Fourier components, that is to say into a series of 
monochromatic waves, the wave numbers of these waves will be found 
clustered about the mean value K 0 in a range of order A K given by 

the relation A 1 /olx 

AxAR ~ 1 . (31) 


AxAK ~ 1. 
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This can be shown most simply as follows: we may represent the wave 
group at any instant of time by 

0 __ (32) 

the exponential function exp(— x 2 /a 2 ) is chosen as a simple and con¬ 
venient function which tends to zero rapidly as x increases to values 
greater than a. Thus Ax ~ 2a. If we analyse this function into plane 
waves it will take the form 

00 

0 = f A(K)e- nlKx dK. (33) 


It may easily be verified by direct integration that if we take 
A(K) = w»aexp[-(JC-JS: 0 )W], 

(32) and (33) are identical. A(K) is thus very small except in a range 

AX given by AK = 2/„. 


Thus Ax AK = 4 / 7 t, which is of order unity. Equation (31) is therefore 
verified. 

It will be seen, therefore, that the shorter the length of a wave group 
the greater is the spread in the values of the wave-number, and thus 
in wave-length or frequency. This is of course a well-known and 
important principle in radio reception where too selective a receiver 
distorts short pulses or musical notes of high pitch, and also in optics 
where the breadth of a spectral line, and thus the spread in frequency, 
depends on the time during which the atom emits the wave and thus 
on the length of the wave train. 

The application of these principles to quantum mechanics is as 
follows: we return to the idealized experiment shown in Fig. 5 in which 
a beam of particles moving with velocity V falls on a screen with a slit 
in it, which can be opened or closed by a shutter; the shutter is opened 
or closed for a short time l 0 , and a cloud of particles of length Ax equal 
to Vt 0 passes through. The density of particles in this group is 
determined by the wave function \jj. Following the principles described 
in the last paragraph, this wave function can be analysed into plane 
waves of wave-number lying in a range AK about that of the original 
wave incident on the screen, where AK ~ 1 jAx. Since a wave of wave- 
number K describes a beam of particles of velocity hKjm , it follows 
that, whereas the beam of particles falling on the screen had a well- 
defined velocity F, the velocities of those that pass through the screen 
have a spread of velocities AV defined by 


AV Ax ~ hjm ; 
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or, in terms of the momentum p, 

Ap Ax ~ h. (34) 

This relation is known as the Uncertainty Principle, and was first 
formulated by Heisenberg, f 

The principle shows that, if wave mechanics is valid, and if it is 
always possible to describe a beam of electrons by a wave function, 
then any attempt to form a beam of electrons of finite and known 
length A#, as for example with slits and shutters, will necessarily disturb 
the momentum p of each particle by an unknown amount, the spread 
A p being given by (34). 

The principle, however, goes further, and leads to a formulation of 
wave mechanics more general than that considered up to this point. 
Let us suppose that at a given instant of time measurements are made 
in any way of the position and of the momentum of a particle, and that 
the accuracy of the measurements are such that its position lies between 
x—\Ax and x-\-^Ax 9 and the momentum between p—\Ap andp-f JAp. 
Then the equations of either Newtonian or of wave mechanics will 
enable the future position of the particle to be predicted, with certain 
uncertainties arising from the inaccuracy of the original measurement. 
The procedure according to wave mechanics will be as follows: we must 
write down a wave-function tp which embodies the knowledge which, 
as a result of measurements, we have obtained about the position and 
momentum of the particle. The wave function will have the form of 
a wave group of length Ax and will be made up of waves with wave 
numbers in the range A p/h. The wave equation (26), being linear in 
the time, will enable the form of ip at any future time to be calculated. 
\ip\ 2 dx will then give the probability that the position of the particle 
at that time lies betw een x and x-\-dx. 

If the accuracy of measurement is such that Ax Ap ~ h the wave 
function will have the form of the wave group written down in 
equation (33). If Ax Ap > h (as is certain to be the case in practice) 
it is possible to imagine a number of these functions superimposed, 
with random phases and values of K 0 distributed over the range AK\ 
the form of the wave function will be analogous to that of a wave pulse 
of white light. But if Ax Ap is much less than h i it is impossible to write 
down a wave function at all which represents the results of the measure¬ 
ment. We must conclude, assuming that wave mechanics will always 
describe correctly the motion of a particle, that it is impossible to 


t W. Heisenberg, Zeita.f . PhysiJc, 43, 172 (1927). 
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determine simultaneously the position and the momentum of a particle 
with errors less than those given by the uncertainty relation (34). This 
limitation, moreover, must not be one which is temporarily imposed 
by the clumsiness of the measuring instruments now available, but 
must lie deep in the nature of things; for were it possible to overcome 
it, the whole edifice of wave mechanics would fall to the ground, and 
a direct contradiction would arise between the deductions made from 
experiments on electron diffraction and the result of our hypothetical 
experiment. 

It is therefore of great interest to examine the hypothetical experi¬ 
ments by which we could determine position and momentum simul¬ 
taneously, and to show that they do in fact yield an uncertainty of 
the predicted amount. The most famous of these demonstrations is 
the ‘Gamma ray microscope’ first discussed by Heisenberg. The argu¬ 
ment put forward is as follows: A beam of electrons is supposed to be 
travelling along the #-axis with known velocity V and momentum p. 
It is desired to observe an electron and to measure its position; for this 
purpose it is imagined that a microscope will be used, and since the 
utmost accuracy is required a short wave-length should be chosen. 
The position can then be determined to an accuracy given by 

Ax — A f/a, (35) 

where a is the aperture, A the wave-length, and / the distance from 
the electron to the lens. 

Radiation cannot be scattered by an electron without disturbing the 
electron; radiation is scattered by free 
electrons according to the rules of the 
Compton effect, according to which the 
momentum lost by the light quantum 
when scattered is transferred to the 
electron. Thus if a quantum of frequency 
r, and hence momentum hv/c is scattered 
through an angle 8 , momentum 

hv(l— cos0)/c 

is transferred to the electron. Thus we 
cannot observe the electron without disturbing it. Moreover, we disturb 
it by an unknown amount, since owing to the finite aperture of the 
lens, 6 is not known exactly. In Fig. 6 , 8 may lie between ABC and 
A BC '. There is thus an uncertainty ajf in 8 , and hence, since 8 ~ 90°, of 

hva/cf 

D 



6072 
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in the momentum transferred to the electron. Since A = cjv this may 

be written A , 

A p = ha If A, 

where A p is the uncertainty in the momentum of the electron after the 
measurement has been made. We see that 


as we expect. 


A p Ax = A, 


7. Wave functions for the momentum of a particle 

It is implicit in what we have said already that if at a given instant 
of time a particle is described by a wave-function i/j(x), deductions can 
be made about the momentum p of the particle as well as about its 
position x . In this section we shall show in detail how this can be done. 
We shall confine ourselves to particles moving in one dimension; the 
generalization to more complex systems is straightforward. 

With regard to the spatial coordinate, wo havo seen that: 

(a) The probability that x lies between the values x ' and x'-\~dx f is 

\t/j(x')\ 2 dx'. 

(b) The mean or most probable value of x is x given by 

x = J x\ifs(x)\ 2 dx. 

We require similar information about the momentum p. We want in 
fact a function g(p) such that \g(p')\ 2 dp' is the probability that p lies 
between the values p' and p ,J rdp', and 

P = j p\g(p)\ 2 dp 

is the mean value of p. 

It is clear from the considerations of the last section that g(p) will 
be equal to the appropriate Fourier component of ip{x). We may thus 

s®tf 

g(p) — A J e~ iprlr, ip(x)dx. (36) 

— CO 

The constant A will be determined by the condition that 

/ \ff(P)\ 2 dp--= !• 

This gives A = 1 /^(2nh), 

The proof is as follows. Consider the integral 

Po 

j I g(p)\ 2 dp- 

—po 

t is written for h/2n. 
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It may be written 

JJJ e ip{x ~ x,) ' ih ifs{x)^{;x') dpdxdx\ 

which, on integrating with respect to p , gives 

JJ sin { --"* ^(a#(*') dxdx'. 

Integrating with respect to x' and making p 0 tend to infinity, and 
making use of the equation 


OC 

/ 


sin# 

x 


dx 


we obtain 


2vK J \i/j(x)\ 2 dx. 

The function exp( — ipxjh )(2 tt^) is known as a ‘transformation 
function’, by means of which we can transform the wave function 
giving information about x into a wave function giving information 
about p. 

The mean value of the momentum p is given by 


V = / p\9{v)?dp. 


(37) 


It is often desirable to express p in terms of the wave-function if*(x) 
for spatial coordinates. We shall now show that 

p =.- ~ f i/j*—dx. 
i J dx 

The proof is as follows: 

The quantity p defined by (37) may be written 

p = JJJ pe ip ( x ~ x ‘y R i/j(x)i/j*(x') dxdx'dp. 

We carry out the ^-integration between the limits writing 

Pq/Ji ~~ jfe 0 , and x—x' = this gives 

p = JJ p»j -f j dxdx’. 

After partial integration with respect to x, we obtain 

P = jj sin &<,(%-z') ^l^'jdxdx’. 

Carry out the integration with respect to x', we find as before 


p=~if rw^dx. 
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This identification of the momentum p with the operator (ft/fyd/dx 
has consequences of importance. It enables one to find the average 
value of other quantities, for instance the energy of a particle. Thus 
in classical mechanics the energy H(p, x) of a particle moving in one 
dimension is given by 

Replacing p by (ft/tyd/dx, we see that the mean value W of the energy is 
given by _ 

W = j ip*H<p dx 

=M-£st.+ f <*>K <38) 

If 0 is the solution of a Schrodinger equation for known energy W, 

^g+OV-FW.-O, (39, 

(38) is then a trivial result, obtainable by multiplying both sides of 

(39) by and integrating over all x. Equation (38) is applicable for 
systems described by any wave functions. 

It will be noticed that if p denotes the operator (hjiydjdx, then 


(px-xp)ifj : 


In some formulations of quantum mechanics*]* the quantum conditions 
are introduced in this way, as 


px—xp 


(40) 


It there appears that for any two dynamical variables / and g for which 

fg-gf = o 

it is possible to measure them both simultaneously with unlimited 
accuracy. Such variables are said to ‘commute’. If, however, fg ^ gf, 
the two quantities cannot be determined simultaneously. 

In finding the mean value of any quantity which contains terms of 
the form pa:, an ambiguity arises, since, if p is replaced by (hli)djdx 9 

pxifs # xp*p. 

j- e.g. P. A. M. Dirac, Quantum Mechanics , 3rd ed. (Oxford, 1947). 
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In such cases we replace px by %(px+xp), and thus px\fj by 



i.6. by v( a S +W )‘ 

More generally, a classical term 


i 8 replaced by ‘*{/| + | W )j. 


With this convention, operators have the ‘Hermitian’ property; this 
is defined as follows: If <f> and 0 are any wave functions vanishing at 
infinity and L(p,x) any real function of p and q, 


j dx = f dx. (41) 

For terms of the type p 2 the proof is obvious; for terms of the typo 
f(x)p we have for the difference of the two sides of (41) 







In the next section it will be shown that the Hermitian property is 
necessary if the predictions from wave mechanics are to be in accord 
with the facts. 


8. Hamiltonian equations of motion 

In the preceding sections a method has been given whereby the 
wave functions of a particle can be set up, which will give information 
about the momentum p and the positional coordinate x . For certain 
purposes it may be necessary to consider more complicated dynamical 
systems, such as rigid bodies (for certain purposes molecules can be so 
treated) or numbers of interacting particles. For such systems we shall 
make use of generalized positional coordinates q l9 q 2 ,---, q n > where n is 
the number of degrees of freedom of the system, and Hamilton’s 
equations of motion. We require a wave function ^(<7 i,? 2 v) 
the coordinates, the interpretation of which will be that 

Wffi.?.»?n)l 2 dq 1 dq i ...dq n 

is the probability that, at the instant of time considered, the coordinate 
q x lies between q x and q x +dq v q 2 between q 2 and q 2 +dq 2i and so on. 
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We require also an equation linear in the time of the type of (26), to 

determine tp at all subsequent times if it is given initially. 

The Hamiltonian equations on which we shall base the theory are 

as follows: A ‘Hamiltonian function’ H(p rt q r ) (r = of the 

positional coordinates q r and momenta p r is set up; the equations of 

motion are then 0 „ TT 

</Pr __ ___ OB dq r _ ^ ffl 

fit dq r 5 dt dp/ 

The Hamiltonian function and momenta p r are defined as follows: 
first the kinetic energy T and potential energy V are written down as 
functions of q r) q r . The Lagrangian function is then defined by 

T—V 

The generalized momentum p r is then defined by 

Pr = ^ ( 43 ) 

and the Hamiltonian function by 

H = ^p r q r —L, 

where the q r are to be eliminated by means of (43). 

Where V does not contain the time explicitly H is equal to the energy 
of the system. Thus for a particle in a field of force 


H “ 2 ^l+Vl+PD+V. 

The generalized Schrodinger equation that will replace (26) is 

I Ob 


where p r is to be understood for (h/i)dj8q r . This is suggested, first 
because it reduces to (26) for the case of a particle moving in a field, and 
also because it leads to the classical Hamiltonian equations of motion 
(42) for the centre of gravity of a wave packet. This will now be proved. 
The mean value p r of p r is given by 

Pr = J *p*P r 'l'dq, 

and thus 

~ — J ( ii/*p r if)+tp*p r ip)dq = J — 

on substituting for ip, ip* from (44) and writing p r = (h/i)dldq r The 
right-hand side gives 
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The first two terms vanish by the Hermitian property (41) and we 
are left with „ 


Vr 


Examples of systems for which a wave equation must he deduced 
from a Hamiltonian function are: 


(a) Motion of a system consisting of two or more particles. Tf their 
spatial coordinates are (x 1 ,y v z l ) and (x 2 ,y 2 ,z 2 ), a wave function 
i/j(x v y v z 1 ; :r 2 , ?/ 2 , z 2 ) wl ^ describe the system, and 


I 2 dx x %i dzi dx 2 dy 2 dz 2 , 

will give the probability that the first particle is at the point (x v y v zf) 
in the volume element dx 1 dy 1 dz 1 and at the same time the second 
particle is at the point (r 2 , ?/„, zf). 

(b) Motion of an electron in a magnetic field. In this case there is no 
potential energy function, and so the definition of II given above cannot 
be used. Nevertheless a Hamiltonian function can be set up so that 
equations (42) are valid. This function is 


1 

2 m 




+ V, 


whero A is the vector potential defined so that the magnetic field H 
satisfios H — curl A. 


That this is the correct Hamiltonian may be verified as follows: 
Hamilton’s equations give 


and 



(45) 


where the suffix r can have the three values x, ?/, z and q x is written for 
x. Substituting in the first of those equations, we find 


Vr = 


e XT' . dA s dV 
c £ dq r dq r 


while the second gives, on differentiating with regard to the time 

e V s dA„ . 
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This, in vector form, may be written 

mv == —grad V -j--[vH], 
c 

which gives the classical equation of motion. 

In most physical applications H may be considered small, and the 
Hamiltonian becomes 

~ (pl+pl+P%)+~ (A x 'p x \-A y p u -\-A z p z )+V, 

from which the wave equation can be written down in the usual way. 
As we saw above, (Ap)i p has to be replaced by 

\ 2 {(A gradi/i)+div(Ai/r)}, 

and if div A vanishes this gives 

^ (A grad 0). 

The Schrodinger equation is thus 

- h , = _A 2 _v 2 .//+ he . (A grad Vifi. 
in 2m ,T m«' b r) r 


(46) 



II 

STATIONARY STATES 

9. The old quantum theory 

It was Niels Bohr in 1913 who first brought forward the hypothesis 
that the energy of an atom is quantized. This hypothesis means that 
the total energy of the electrons in an atom cannot have any arbitrary 
value, but must have one of a series of discrete values, of which one 
is the lowest and is thus the energy of the normal state of the atom. 
Bohr's hypothesis was made in order to explain a series of experimental 
facts, and a theory (the old quantum theory) was built up in accord 
with many of these facts, based on a few assumptions such as the 
quantization of angular momentum. It is, of course, one of the most 
striking successes of Schrbdinger’s equation that the quantization of 
energy follows from it naturally and without the introduction of special 
hypotheses. 

The experimental facts on which the quantization of energy rests are 
widespread and of very different type. We may mention first the fact, 
fundamental to crystallography and indeed to chemistry, that atoms 
and molecules seem to have each a characteristic size.f It is useful to 
think of many atoms and ions, especially those having the ‘rare gas’ 
configuration of electrons such as K + or Cl - , as behaving much like 
elastic spheres when brought into contact with each other. This 
behaviour seems quite incompatible with the picture of the atom that 
we owe to Rutherford, that of a number of electrons moving in the 
field of a positive nucleus; the dimensions of such an atom would 
depend entirely on the speed with which the electrons were moving 
and hence on their energy, and could be very small for large negative 
values of the energy. 

A second piece of evidence of the same type is provided by the specific 
heats of monatomic gases which at constant volume approximate closely 
to I R. This means that the whole of the thermal energy of the molecules 
of the gas consists of translational energy, and that the internal degrees 
of freedom do not make any contribution. It follows that the electrons 
in the molecules cannot normally give or receive energy on collision 
with other molecules, and this in its turn implies that the minimum 
amount of internal energy that a molecule can take up is too large to 

t See, for instance, the tables of ionic radii given by V. M. Goldschmidt {Trans. Fara¬ 
day Soc , 258, 1929) or W. H. and W. L. Bragg, The Crystalline State (London, 1933). 
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be transferred in a collision at ordinary temperatures. This shows 
that the energy must be quantized. 

Since the hypothesis was originally put forward, numerous experi¬ 
ments have been carried out in which molecules of gases have been 
bombarded by electrons and it has been shown that the energy lost 
by a colliding electron must ha ve one of a series of discrete values, 
unless enough energy is lost to ionize the molecule. We may thus take 
it as an established experimental fact that for any gaseous atom or 
molecule of given type there exists a definite state of greatest stability 
(the 'normal state’), in which the atom will normally be found; and 
that if it absorbs energy, either from light, a colliding electron, or in 
any other way, it will absorb an amount equal to one of the terms of 
a definite series. The atom is then left in an 'excited state 5 . The series 
tends to a definite limit, the ionization energy, /. Any amount of 
energy greater than / can be absorbed, an electron being then ejected 
from the atom. 

The energies of an atom in the normal or excited states are usually 

defined with the convention that the 
total energy is zero when one electron 
is removed from the atom and is at rest. 
The energies W 0 of the normal state and 
Wj, of the excited states are then all 
negative. They are shown schematically 
in Fig. 7. The ionization energy is equal 
to —■ W 0 and the first excitation energy, 
that is to say, the smallest energy that 
can be absorbed by an atom in the normal state, is equal to \ W 0 — Wy\. 

Further strong evidence for the existence of stationary states in 
atoms is the fact that the optical emission and absorption spectra 
consist of lines. The mere existence of line spectra proves that the 
energy must be quantized, since we know that when light of frequency 
v receives or gives up its energy to matter, it does so in quanta of 
amount hv . It follows that an atom can only change its energy by one 
of a series of definite values. Further it is found that in the line 
spectrum of a given atom, groups of three lines of frequencies v v r 2 , v z 
such that , 

^1 i ^2 = ^3 

are frequent. In terms of the theory of stationary states, this will mean 
that the three lines correspond to transitions between states W a , W b , W c , 

hv i — W a —W b , hv 2 = W b —W ci hv z = W a —W c . 


-Zero 


-Wi 

-IK 

-w; 


-H' 


Fig. 7. Energy lovols of mi atom. 
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From an examination of the line spectra of atoms it is in fact usually 
possible to determine the relative values of the stationary states, all 
emission lines being ascribed to transitions between a series of states 
as shown in Fig. 7. Absorption lines in a cold gas, where all atoms are 
in the ground state, correspond to transitions between the ground state 
only and the excited states. 

In the line spectrum of atomic hydrogen the frequencies of the lines 
were found experimentally to conform to a simple formula; the fre¬ 
quencies can be expressed as 



On the left we have the wave-number (the reciprocal of the wave¬ 
length); on the right n x and n 2 arc integers and R is a constant, known 
as Rydberg’s constant. It is equal to 

R r-= 109,()77 cm/ 1 

From this fact we can deduce that the hydrogen atom has a series of 
energy levels, and that the energy of the ni\\ level is given by 
W n -= —hcR/n 2 (n ^ 1,2,...), 

subject to the convention already made that the energy should be zero 
when an electron is just removed from the atom. 

The hydrogen atom is a particularly simple structure, consisting as 
it does of a single electron moving in the field of a heavy positively 
charged nucleus. According to Newtonian mechanics the electron ought 
to move round the nucleus in circular or elliptic orbits; the energy 
could have any value from zero to minus infinity. Niels Bohr in his 
original theory assumed that the orbits were in fact Newtonian, but 
that only a restricted series of orbits were possible. Confining ourselves 
to circular orbits, we shall now show what this limitation must be if 
the observed series of energy levels is to be obtained. 

Suppose that an electron with charge e and mass m moves round an 
infinitely heavy nucleus carrying an identical charge of opposite sign. 
If the radius of the orbit is r and the velocity of the electron is v, then 
the electrostatic attraction between them, c 2 /r 2 , will be just balanced 
by the centrifugal force mv 2 jr\ we thus have 

— = mv 2 . (1) 

r 

The potential energy of the system is --e 2 /r, and the kinetic energy 
Jmv 2 ; adding these together we see that the total energy is 

\mv 2 — e 2 lr t 
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which by (1) is equal to — -|e 2 /r. But we know from experiment that 
this is equal to —hcR/n 2 , where n is an integer; it follows that 

1 e 2 _ hcR 

2 r ~~ n? 

or that r -- n 2 e 2 j2hcR. (2) 

Tt follows that the radii of the allowed orbits are proportional to the 
square of an integer. 

Again, from (1), v = (—(3) 

\ m I n 

and multiplying (2) and (3) we obtain for the angular momentum 

me? 

mvr = n. 

(2 mhcR)* 

Now Bohr’s assumption was that the angular momentum should be 
quantized ; and the elementary derivation given here shows that, if it 
is assumed that the electron moves in a circular orbit obeying Newtonian 
mechanics, this can be deduced from known experimental facts. There 
was no a priori reason to quantize the angular momentum in preference 
to any other constant of the motion such as the energy; Bohr’s choice 
of the angular momentum could only be justified by its results. 

Bohr also pointed out that, to within the limits of experimental error 
to which these quantities were known, rne 2 /(2mhcR )* was equal to h/2 it. 
Planck’s constant had the right dimensions and had already been used 
for the quantization of light; there was, however, no reason a priori 
why the multiplying factor should be 1/277 rather than anything else. 

The assumption made by Bohr in building up a system of atomic 
mechanics was thus that the angular momentum 12 of an electron in 
an atom or of any rotating system should be a multiple of ft/27r or ft, 

£2 = nh/27r (n = 1,2,3,...). 

This assumption had a series of striking successes of which a few will 
be enumerated here. 

9.1. Allowance for the rotation of the nucleus about the centre of gravity 

According to the above analysis, the value of the Rydberg constant 
for an infinitely heavy nucleus should be 

R = 277 2 e 4 /mft 3 c. 
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For an electron of mass m revolving round a nucleus of mass J/, it will 
easily be seen that R is given by 

R =■= 2W/m*A 3 c, 

with m* = m j(l 

The change (one part in 2,000) is too small to be detectable, since 
although R is known sufficiently accurately the other const ants m, e , h 
are not. However, a check on the theory can he obtained by observing 
the lines of ionized helium. In ionized helium a single electron moves 
round a nucleus of charge 2e and mass approximately 4 M, if M is the 
mass of the proton. Thus the line spectrum of ionized helium should, 
according to the theory, be given by the formula 


vjc = 4:R n Jn 2 , 



Experimental values of the two Rydberg constants are 
R Uq = 109,722, R n = 109,077, 

enabling a value of m/M equal to 1/1843 to be obtained, in reasonable 
agreement with values obtained from other sources. 

9.2. Structure of atoms other than hydrogen 

Although the old quantum theory was not able to yield exact 
quantitative results in agreement with experiment for atoms more 
complicated than hydrogen, it was able to give an extremely valuable 
qualitative picture of atomic structure. A theory comprehending elliptic 
as well as circular orbits was built up, and the interaction between the 
electrons was taken into account in a qualitative way. Much of the 
old theory, in particular the concept of the inner K , L , and M shells 
in a heavy atom, has been incorporated into the new theory, but since 
the concept of orbits has been completely discarded, we shall in this 
book build up the whole theory of atomic structure on a basis of wave 
mechanics; this will be done in Chapter VI. The very great service 
rendered by the old theory must, however, be stressed. 

9.3, Rotational energy levels of molecules 

A diatomic molecule rotating about its centre of gravity may be 
treated approximately as a rigid body of moment of inertia I, say. 
If its angular velocity is ca, then Bohr's hypothesis gives 
lot = nh/27r (n = 1,2,3,...). 

The kinetic energy is £Ico 2 = n 2 h 2 /8n 2 I. 
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This formula was used with fair success to account for the rotational 
terms in the band spectra of hydrogen (cf. Chap. V) and also to account 
for the drop at low temperatures of the rotational specific heat of 
hydrogen.! The formula has been slightly modified by the wave 
mechanical derivation (Chap. V) according to which the kinetic energy 
of rotation is n (»+ 1 W/8tt 2 I. 


9.4. Vibrational energy 

It was assumed before the introduction of wave mechanics that the 
quantized energy values of any vibrating system with one degree of 
freedom and natural frequency v would be given by the formula 

W nbv, 

it being supposed that a light wave was just a special case of a vibrating 
system, and that the formula valid for light would be valid for other 
systems. This formula was useful in accounting for the vibrational 
terms in the band spectra of molecules, and for the specific heats of 
solids at low temperatures. It is replaced according to wave mechanics 
by the formula jy = (n+ i)/ tv . 


10. The concept of stationary states in wave mechanics 

It can be shown as a quite general consequence of wave mechanics 
that the energy of any particle which moves in a limited space, from 
which it cannot escape, must be quantized. Thus electrons bound in 

atoms, or atoms as a whole vibrating 
about fixed positions in a solid, will 
have quantized energy values, while a 
free electron in an electron beam can 
have any arbitrary energy. This will 
most easily be seen if we consider a 
very idealized case, that of an electron 
shut up in a box with perfectly re¬ 
flecting sides. We need consider motion 
in one dimension only; we thus suppose that the electron moves along 
the #-axis from x = 0 to x = a, and at those two extremities, where 
the walls are, the wave function vanishes. The wave function must then 
have the form ^ = A sin (*/a)r«, 



Fid. 8. Wave functions for electrons 
in a box. 


t Cf. R- II- Fowler, Statistical Mechanics , 2nd ed. p. 82 (Cambridge, 1936). 
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where n is an integer and A a constant. Substituting into the 
Schrodinger equation rf2 , 

we see that W has the series of values 


W = 

ff 7?- 


7T 2 fe 2 n 3 
2ma 2 ' 


(4) 


We thus have a series of solutions 

*p n = A sm(7rnx/a)e~ ilv » ilh 

which represent standing waves in the box. We also see that wave 
functions only exist corresponding to electrons with energies given by 
(4). If an electron could exist in the box with any other energy, it 
would not be possible to describe it by a wave function. Thus, if wave 
mechanics is to bo universally applicable to the behaviour of electrons, 
it follows that the energy of an electron in a box must be quantized. 



It will be noticed that the quantum-number n has the following 
physical meaning; n— 1 is the number of nodes or zeros in the wave 
function between the two extremities. 

Suppose that we now consider an electron shut up in a box bounded 
not by perfectly reflecting sides but by an electrostatic field which 
pushes the electron back when it tries to get out. The potential energy 
of an electron in this field is shown in Fig. 9. Suppose that the electron 
has an arbitrary energy W. The wave function in the neighbourhood 
of the points A and ii, where the 'classical’ electron would try to get 
out, is as described in Chap. I, §3.2; in the regions where the classical 
electron cannot go \jj will decay exponentially, while within the box it 
will oscillate. If we started to draw the wave function from either end, 
taking the solutions that decay exponentially instead of increasing 
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exponentially, the two solutions will not in general join up in the middle; 
only for a discrete series of energy values W n will they do so, and these 
will be the quantized values that the energy of the electron must have. 

An electron in a hydrogen atom is held in a box very much of 

this type. The potential energy of such 
an electron plotted along a line passing 
through the nucleus is shown in the upper 
part of Fig. 10: an electron with the 
energy represented by the horizontal 
line AB can move freely between the 
points A, where it will suffer total re¬ 
flection. The wave function will be as 
shown below; it w r ill oscillate in the region 
between A and B and die away expo¬ 
nentially outside. Clearly, only for a series 
of energies W n will such a solution of the Schrodinger equation be 
obtainable. 



Fig . 10. Field and wave function 
for hydrogen atom. 


10.1. Quantization with several degrees of f reedom 
We return now to the motion of an electron in a box with perfectly 
reflecting walls, but now consider motion in three dimensions. Let the 
boundaries of the box be given by 

0 < x < a, 

0 < y < b, 


0 < z < c, 

and let the wave function vanish on these boundaries. Then the 
Schrodinger equation \y 


vv- 


K 1 


-0 = 0 


has solutions! 0 = A sin 


7 TU.X . 7 TU»y . 77% z 

—i- sin —™ sm—-- 


for the values of the energy given by 


1F = 


7T 2 ft 2 

2m 


K + ^! + 


$ 


We see that the allowed values of the energy are defined by three 
quantum numbers (%,%,%), and that each quantum number (minus 
one) is equal to the number of nodal planes that the wave function 
has perpendicular to the appropriate axis in space. 


t Here and in subsequent work we omit the time factor. 
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It may happen that the wave equation will have two or more solutions 
for one value of the energy. This will be the case, for example, if two 
of the sides of the box are equal. Thus if a is equal to 6, the two wave 
functions with quantum numbers 

K, ?i 3 ), (n 2 , n v n 3 ) (% ^ n 2 ) 

are solutions of the wave equation corresponding to the same value of 
the energy. The solutions in this case are said to be ‘degenerate’. 

It is instructive to consider the energy levels of an electron in a cubic 
box of side a. The ground state (111) is non-degenerate; the next three 
(21.1), (121), (112) all have the same energy, so the energy level has 
a threefold degeneracy. The next level is (122), (212), (221), and the 
next (311), etc., also with threefold degeneracy. After (222) comes (123), 
etc., giving a sixfold degeneracy. 

It will be noticed that when two or more degenerate levels exist, ifi l 
and t/j 2 , then a linear combination of them 

is also a solution of the wave equation. 


10.2. Electron moving in a closed path 

The following idealized problem is instructive, although it does not 
refer to any actual physical phenomenon. 

A particle of mass m is supposed to move with velocity v on the 
circumference of a circle of radius a. We denote by x the distance 
measured along the circumference of the circle from some arbitrary 
point i\ The wave function describing the motion of the particle will 

th6n 1)6 ^ = Ae ±imvxlfi' 


But clearly this function must be single-valued in x ; on going round 
the circumference of the circle we must return to the same value of ip. 


Thus 


mv(2Trr) 1 
h 2 7T 


must be an integer. Thus the velocity is quantized, and has values 

v = nft/mr. 

This may be written in the form 

mvr = nh. 

This shows that the angular momentum, in this particular case, is 
quantized according to Bohr’s condition (cf. §9). 

5072 r 
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11. Wave functions for the simple harmonic oscillatorf 
11.1. The simple harmonic oscillator in one dimension 
One of the simplest and most important examples in the theory of 
stationary states is that of the linear oscillator. A particle of mass M 
is held to a fixed point P by a restoring force equal to — px , where x 
is the displacement from P. According to Newtonian mechanics, it 
will then vibrate about P with arbitrary amplitude and energy, and 
with frequency v given by 

' „ = J-/£. 

2ttaJ M 

Our problem is to find the energies of vibration allowed by wave 
mechanics, and the associated wave functions. We shall find that the 
energies are given by 

K = (n+\)hv (n =1,2,3,...). 

Before? writing down and solving the Schrbdinger equation wc shall obtain 
1 an approximation to the energies by a simple 

2 ^ )X method. The potential energy of tlio particle is 

V Ipx 2 ' ThisisplottedinFig.il. If the energy 

s' of the particle is W, it can move according to 
^ 7 ^ Newtonian mechanics between tho points A 

_^ and B, that is to say, over a distance 2x 0 given 

by 

v ii •«, . ,. i AB : 2J(2W/p) = 2x 0 . 

loo. 11. Potential energy yK 1 0 

of harmonic oscillator. The average kinetic energy may be taken to be 


X 0 

J (W — Ipx 2 ) dxI j ,ix = fl V. 


Thus the average wave-length is h/<J(4MW/2). If n half-waves are fitted into 
the length AB, wo have 

inhl^(4MW/3) - 2V(2 W/p). 

This gives W = 0-94 nhv, 

an answer of the correct order of magnitude. 

We shall now write down the wave equation and obtain the solution. 
The wave equation is 

g+f?or-b«W= o- 


If we introduce a new variable 




t Cf. E. SohrcVdinger, Ann. tier Physik, 79, 489 (1926). 
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the equation becomes 




( 6 ) 


where A = 2Wjhv. We have to find for what values of A solutions of 
(6) exist which decay exponentially to zero as y tends to oo. 

The solution is obtained by making the substitution 

>A = e~ w f (*/)• 

Substituting into (6), we obtain 

If we now express / by a power series 

f=2*ntr, 

n —0 

we obtain the recurrence relation 


0. 


(n+2)(n+l)A n42 = (2»+l-A)i4„. (7) 

It is thus clear that the equation has two independent solutions of the 
for, “ 

y+A 3 y 3 +A 5 y 5 +..., 

and that the coefficient A n can be determined from (7) in either case. 

Two cases now arise; first, if 2n -f 1 — A does not vanish for any integral 
value of n, then / is given by a power series which is clearly divergent. 
For large n 


x n+ 2 


2 

A n n‘ 

For large n , therefore, successive terms have the same ratio to each 
other as those of the expansion of exp(2?/ 2 ). It follows that for large x , 
the function / tends to infinity as exp (2y 2 ), and thus that our wave 
function i/j tends to infinity as exp(| y 2 ). 

If A — 2w+l, (9) 


where n is a positive integer, the series (8) terminates and / is a poly¬ 
nomial. It is only in these cases that our solution ift tends to zero as 
x tends to oo. 

Equation (9) at once gives us the allowed values of the energy 
W n = {n+\)hv (» = 0,1,2,...). 

The corresponding wave functions are 

h(y) = ^o ex P (~iy 2 )> 

Hv) = Ay^v(-b 2 )> 

M) — -4 2 ( 2 2/ 2 — 1 )© x P(—l2/ 3 )- 
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The multiplying constants A n are arbitrary; but it is convenient to 
choose them so that the functions are 'normalized’, i.e. so that 
J \$\ 2 dx = 1. In that case 

Ao^ir-*, A^Wn)*, A 2 =(l/47r)K 

The functions f n (y) are called Hermitian polynomials, and their 
properties are discussed in a number of text-books. | f n is defined by 

/In 

fn&) = 

Some of the wave functions are plotted against x (our original variable) 
in Fig. 12. We plot also which gives the probability that the 
particle will be found in any unit interval. It will be seen that for 
the higher states the particle is more likely to be found towards 
the boundaries of its vibration than at the centre, where its velocity 
is highest. 



Fig. 12. Normalized wave functions of linear oscillator. 


It is worth noting here that, according to wave mechanics, even in 
the state of lowest energy, an oscillator still has some energy, namely 
the ‘zero point’ energy \hv. This is a result characteristic of wave 
mechanics. 

f For instance, Courant and Hilbert, Methoden der mathematischen Physik , Bd. 1 
(Berlin, 1924). Cf. also Appendix to this book (pp. 370 et seq). 
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11.2. The bounded harmonic oscillator 

In the above analysis we imposed the boundary condition that the 
wave-function </r vanishes at infinity. If we replace this condition by 
the one that the wave function vanishes at the walls of an enclosure we 
may obtain the energy levels of a ‘bounded’ harmonic oscillator. This 
problem has been treated by several authors, f It is found that the 
effect of enclosing the oscillator is to displace its energy levels towards 
higher energies, the displacement increasing with decreasing dimensions 
of the enclosure. Thus if we write £ 0 = |Z(4cw//<t 2 ) i , where ifj(x) = 0 at 

# ^ i then ^ jit 

_ Sl > 0 

dt o ’ 

where W n is the energy of the nth state. The formula for W n is compli¬ 
cated but reduces to a simple form for £§ (n —|), when it becomes 


w n = (n-i)ha>+l-Yfiu e ~ Hie ^ 


(q-l)l 


and for £§ n - 


when it reduces to 

w — — 
n “ 4ft 


ha), 


showing that as £ 0 0, W n oo. 


11.3. Simple harmonic oscillator in three dimensions 
Suppose that the restoring force is —PxX for displacements in the 
^-direction, —p 2 y for displacements in the ^-direction, and so on. 
The potential energy V is given by 

V = — l(pi z 2 + 2 > 2 y*+p 3 z 2 ) • 

The Schrodinger equation is separable; if we write 

ip = 

then iff is a solution if 

^+^(W 1 -lp 1 x*)'P 1 ==0, etc. 
and W = W 1 +W 2 +W 3 . 

If v i> v 2 > v z are the frequencies for vibration in the three directions, the 
energy values are 
W 

— = v x (n x + 

t F. C. Auluck, Proc. Nat. Inst. Sci. India , 7, 133, 383 (1941); 8, 147 (1942); F. C. 
Auluck and D. S. Kothari, Science and Culture , 6, 370 (1940); Proc. Camb. Phil. Soc. 41, 
175 (1946); S. Chandrasekhar, Astrophys. J. 97, 268 (1943). 
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12. The wave mechanical treatment of the hydrogen atomf 

In this section we shall treat the motion of an electron in a central 
field of force, and in particular in the Coulomb field of a charged positive 
nucleus. We shall first treat the nucleus as of infinite mass; the cor¬ 
rection for its finite mass will be made in Chapter V. Denoting by r 
the distance of the electron from the nucleus, we thus write V (r) for 
the potential energy of the electron in the field of the nucleus. If the 
electron moves in the field of a positively charged nucleus of charge Ze, 

then V(r) = — Ze 2 /r. 

The Schrodinger equation is 

Vy> + ~(W~V)t=0. ( 10 ) 

In this section we show that, for negative values of W , solutions exist 
which tend exponentially to zero outside the atom if W has one of the 
values 




mZ 2 e 4 


This is the same series of values that was given by Bohr’s original 
theory. Solutions bounded at infinity exist for all positive values of 
W ; these correspond to the ionized state of the atom and will be dis¬ 
cussed in Chapter IX. 

The first step in the solution of equation (10) is to transform it to 
spherical polar coordinates 6 , The equation transforms^ into 


r 2 3r \ 3rj r 2 sm 6 36 \ 36/ 


_i—L_ = o. 

+ r 2 sin 2 e 8<f>^ h* K >V 


This is solved by setting 

0 = i*(r)0(0)<D (<f>), 

where R, 0, $ are functions to be determined. It can easily be verified 
that <j> satisfies (10) if R, 0, O satisfy the following equations 

d 2 0 . 


,+w^> = 0, 


1 d 

sin# dd 




'■(W—V)- 


1 ( 1 + 1 ) 


R = 0, 


t E. Schrodinger, Ann, der PhysiJc, 79, 361 (1926). 

J C. E. Weatherbum, Advanced Vector Analysis , p. 15 (London, 1944). 
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where u 2 and l{l+ 1) are arbitrary constants which have been written 

in this form for reasons which will appear below. 

Now O must be a single-valued function of the azimuthal angle <f>\ 

that is to say, when cf> is increased by 2n or any multiple of 27 r, <1> must 

be unchanged. Thus u in (13) must be an integer, and we have for the 

general solution , . . . _ . 

° d> = A cos B sm u<f), 


where A and B are arbitrary constants. It will be convenient to ascribe 
to u positive and negative integral values and take the general solution 

to be <D A**. 


The equation (14) is the well-known equation of which the solution 
is the associated Legendre polynomial Pf( cos#). If l is integral and 

l > \u\, 

then there exists one solution Pf(cos0) which is bounded in the range 
0 ^ 0 ^ 77. For other values of l no bounded solution exists. 

Values of Pf(cos0) are (we w r rite P £ = P t ) 

P o (cos0) = 1, 

P^cos#) = cos#, P\{ cos0) — sin0, 

P 2 (cos0) = §cos 2 0—-1. 

We thus see that solutions of Schrodinger’s equation can be classified 
conveniently according to the value of l. If l = 0 the solution is a 
function of r only, and thus spherically symmetrical. The solution is 
non-degenerate (except for certain special forms of V(r ), of which the 
Coulomb form is one). The states corresponding to such solutions are 
known as 5-states.f It will be shown below that the angular momentum 
corresponding to such states is zero. 

If l = 1, the function @0 can have either of the three forms 

cos 0 , sin 0 e^, sin 0 


These three independent solutions necessarily correspond to the same 

energy value; therefore any combination of them would give a solution, 

for instance, „ . a , . Q . , 

eos0, smfleos <£, smtfsin<£, 

i.e. z/r, x/r, yjr. 

Thus three independent solutions with l = 1 have each a nodal plane 
passing through the origin. The states corresponding to l = 1 are 
called jp-states. 


t The letters s, p, d, /, now used to describe Btates with l — 0, 1, 2, 3 wore originally 
introduced before the advent of the quantum theory to describe certain series of lines, 
8 stands for sharp, p for principal, d for diffuse, / for fundamental. 
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Similarly the states corresponding l = 2 are called d-states. 

In general there are 2Z+1 different values of u corresponding to a 
given value of l\ the state corresponding to a given value of l shows a 
(2Z+l)-fold degeneracy. 

We turn now to the equation (15) for the radial part of the wave 
function. We shall take V(r) to be either a Coulomb field, or a field 
of the type K(r) _ 


where Z tends to unity as r tends to infinity, and to some larger value Z 0 
as r tends to zero. We shall see in Chapter VI that this is a good 
approximation to the field in a heavy atom with one valence electron, 
such as an alkali atom. 

We first substitute in (15) 


R{r) =/(r)/r; 

we then obtain jp + 0 W—V )—— 0. (10) 


This equation is of the same type as the Schrodinger equation for the 

motion of an electron in one dimension. 
The type of solution can be seen by 
plotting against r the quantity enclosed 
in curly brackets. If W is negative but 
| If | is small enough, it will appear as 
shown in Fig. 13. Where it is positive / 
will oscillate, where it is negative / will 
tend exponentially to zero. The quantum 
state will be defined by the number of 
nodes in the wave function; four are 
shown in the figure. They will appear 
in the wave-function i/j as nodal surfaces 
of spherical form. 

We denote the total number of nodal surfaces in the wave-function 
ift by n— 1, and call n the principal quantum number, n is then equal 
to unity for the ground state, which has no nodes. Since l is the number 
of nodal surfaces passing through the origin, n—l—l is the number 
of nodes in /(r), i.e. the number shown as 4 in Fig. 13. We would 
expect the value of the energy associated with a given quantum state 
to depend primarily on n. Actually for the pure Coulomb field 
(V = —Z 0 e 2 /r), the analysis given below shows that the energy is 
actually independent of l, and depends only on the principal quantum 


z?n /UT L{W ) 



Fig. 13. 





§ 12 WAVE MECHANICAL TREATMENT OF THE HYDROGEN ATOM 57 


number n according to (11). This is not the case for any other field; 
for the field described above the energy with l = 0 is the lowest. The 
scheme of energy levels is shown schematically)* in Fig. 14. 

Hydrogen_ Alkali 


it -2 



7r-2.M.u-1,0.1 
n=2,l°0 


71=1 



7i«l>0 


Fic. 14. Levels for hydrogen and alkali atoms. 


We turn now to the solution of equation (10) for the particular case 
when V is equal to — Ze 2 /r. If we set 


r = ~V( 2m l W\) x 


the equation becomes 


where 




(17) 


For large x, f will behave like and as we require the solution which 


behaves like e~ x , we set 


/ = e~*g(x). 


We obtain, substituting into (17), 

d*g »dg (* W+1)L_ 0 

dx 2 dx x 2 j 

We attempt to find a solution of (18) of the form 


g = & 2 A s x? . 

s = 0 


(18) 


(19) 


For a this gives <7(<x—1) = 1(1+ 1), 

whence o* == i+1 or — l. We require a solution that vanishes at the 
origin since R must remain finite; therefore we take the first solution. 
Substituting (19) into (18) we then obtain the following recurrence 
relation for A s : 

A s [(s+l+l)(s+l)-l(l+l)] = 4.-,[2(«+l)-4 


t In an actual atom tho lowest levels will be forbidden for the valence electron by the 
exclusion principle. 
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If s is very large, 
of s 9 then 


and 2(s+l)—a does not vanish for any integral value 
A slA-i ~ 2/tf. 


The ratio of successive terms in the series is thus the same as the ratio 
of successive terms in the expansion of e lx . Thus g tends to infinity 
as e 2 ^ and /, and hence 0, as e x . 

If, however, 2(s+l)—a vanishes for any integral value of s , the series 
terminates. g(x) is thus a polynomial and f(x) tends exponentially to 
zero. The allowed energy values are therefore given by 
« = 2 (s+l) (* = 1,2,3,...), 


i.e. 


W = 


Z 2 e 4 m 1 


2 #* 


We see that the energies depend only on a single quantum number n 


defined by 


n 


■ s+l. 


If n ■=■ 1, we must have l — 0, if n = 2, l — 0 or l = 1, and so on. 

The wave functions will be written down in a normalized form below. 


n = 1, l = 0, u = 0 

if/ = 7T~t(Z/a)e~~ r l a , 
n = 2, Z = 0, % = 0 

W2 — ~]e~ r/2a , 

4V(27r)\«j \ a) 
n == 2, Z = 1, n = —1,0, +1 


0 = 


^\i r 


.-r/2a 


1 cos^ , . 


4<J(27r)\a) a V2 sin 6 




A very full account of the normalized wave function for hydrogen 
can be obtained from a number of text-books, for example, Pauling and 
Wilson, Introduction to Quantum Mechanics (New York, 1935). 

The problem of the hydrogen atom enclosed in a sphere (the proton 
being at the centre of the sphere) has been treated by Sommerfeld and 
Welkerf who find that when the radius of the sphere is less than 1*835 
times the radius of the first Bohr orbit, the energy of the system is 
positive, so that the electron is exerting a pressure on the walls of the 
sphere. 


12.1. Interpretation of the wave functions 

The wave function for an electron in an atom must be interpreted 
in the same way as for a free electron; \ifj(x,y,z)\ 2 dxdydz gives the 
probability that the electron will be found in the volume element dxdydz 
t A. Sommerfeld and H. Welker, Ann . tier Physik , 32, 56 (1938). 
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at the point (x,y,z). It is here assumed that the wave function is 
normalized, i.e. that 

J z)\ 2 dxdydz = 1. 

An equivalent interpretation is the statement that 

e|#E,?/,z)| 2 

is the charge density in the atom averaged over a time long compared 
with the period of the electronic motion. 

Some further points about the wave functions are worth noting. 
The wave functions of the 5-states are spherically symmetrical; it will 
be shown later that they correspond to states of zero angular momen¬ 
tum. They have a maximum at r — 0, w r hieh shows that the electron 
has a large probability of penetrating into the interior of the atom. 

The wave functions of the p-statos all have a nodal plane through 
the nucleus and R(r) vanishes there; this means that the electron has 
small probability of being found near the nucleus. For d-states, where 
two nodal planes pass through the nucleus and R(r) behaves like r 2 , 
the probability is smaller still. 

12.2. The momentum density functions 

We saw in Chapter 1 that, if the probability that an electron moving 
on a line lies between x' and x'-\-dx' is \i/j(x')\ 2 dx', then the probability 
that its momentum p lies between p' and p'-j-dp' is 

\g(v ')\ 2 dp’> 

where (/(/>) — p j i/>(x)e~ ipxlh dx. (20) 

It is clear that this result may be extended readily to the case of an 
electron moving in three-dimensional space. Thus if 0(r) is its wave 
function in #?/z-space it follows by rejieated application of (20) that its 
wave function in momentum space is 

x(P) = P J <P(r)e~ iip ' t)ln dT, (21) 

where dr denotes a volume element and (p. r) denotes the scalar product 
P»*+PyV+P,*- 

By means of this formula we can evaluate the distribution of momen¬ 
tum in the various states of the hydrogen atom. Thus for the ground 
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Thus, using the result 


/ 


e —ar+«p.f_ 


8rr<x 


(« 2 +/ 3 2 ) 2 ’ 

we obtain for the wave function in momentum space 

x(p) , ? . 


17 (p-pa-p-f' 
so that the momentum density function is 

8 a*h 5 


\X(P )\ 2 


7r 2 (ft 2 +aW 


The density functions for the other states can be evaluated in a similar 
way. Pauling and Podolskyf have calculated the momentum function 
for hydrogen-like atoms, but very little theoretical work has been done 
on momentum distributions in atoms. The momentum distribution of 
electrons in atoms and molecules is of importance in electron scattering 
and will be considered later (Chap. VII). 


12.3. The continuous spectrum 

Solutions of the Schrodinger equation for the hydrogen atom also 
exist for positive values of the energy W. These correspond to the 
ionized state of the atom, and behave at infinity like r~ 1 sin(fer+e). 
They will be discussed further in Chapter IX. 


13. The rigid rotator 


A further problem which can be solved by elementary methods is 
that of finding the rotational energy levels of a diatomic molecule. 
We treat the molecule as a rigid rod, able to rotate about its centre 
of gravity with moment of inertia /. Then if d, <f> are the spherical 
polar coordinates defining its position in space, the Schrodinger 
equation is 



1 d 2 iff 


sin a 0 d<f>\ 


] + Wil* 


= 0 . 


We solve as before by setting 


and obtain 


0 - 

4> = 


and 



0. 


f L. Pauling and B. Podolsky, Phys . Rev. 34, 109 (1929). 
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This has bounded solutions Pf( cos0), if 

W = m(l+l)l2I. (22) 

This differs from the corresponding solution obtained from the old 
quantum theory in that Z 2 is replaced by Z(Z+1). 

The bounded rigid rotator has been considered by Sommerfeld and 
Hartmann, f 


14. Schrodinger’s equation in invariant formj 

The invariant form of the Schrodinger equation may be obtained 
from the kinetic energy by a simple process. Suppose that in terms 
of generalized coordinates q l (i — l,...,?i) the kinetic energy T of a 
dynamical system may be written in the form 


T = % aqtf#. 


then we may consider the quantities to be the metric tensor of a 
coordinate space of n dimensions. Thus in rectangular Cartesian 
coordinates the a { j then represent masses and the a ij the inverses of 
these masses. In this space the invariant form of the Laplacian 
operator is 1 ^ ^ , 


Va 2* dqi \' 


d_ 

a z 


* 


The other invariants in the wave equation can be constructed similarly. 
We can write the Schrodinger equation in the invariant form 



f jL _1 
yla dq k 


^slaa k} 



^ i dt 


0. 


(23) 


14.1. Symmetrical top molecules 

If a molecule has an axis of symmetry this axis will be one of the 
principal axes of inertia. If a molecule (such as 0H 3 C1) has a threefold 
axis the moments of inertia about any three axes rjrj', ££' lying in 
a plane normal to the axis of symmetry are equal. Since the momental 
ellipsoid intersects this plane in an ellipse, and since no ellipse can 
have three equal diameters at angles of 120° unless it degenerates into 
a circle, the curve of intersection must be a circle and the momental 
ellipsoid an ellipsoid of revolution. Thus a molecule of the type with 
a threefold axis behaves dynamically like a symmetrical top. Further¬ 
more, there are molecules of lower symmetry (or no symmetry at all) 

•(■ A. Sommerfeld and H. Hartmann, Ann. der Physik, 37, 333 (1940). 

J Cf. E. Sohrodinger, ibid. 79, 748 (1926). 
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in which it so happens that two of the principal moments of inertia 
are equal; they, too, behave like symmetrical tops* 

For this reason the wave equation for a symmetrical top is of interest 
in quantum mechanics.f 

To describe the motion of a symmetrical top we make use of Euler’s 
angles 0, <£, x • 0 denotes the angle between the 2 -axis in space and the 
axis of symmetry Oz' of the top, <j> and x the angles between the line 
of nodes on the line of intersection of the planes xOy , x'Oy' and the lines 
Ox , Ox' respectively. If A and C denote the moments of inertia of the 
top about Ox' and Oz' respectively, then the kinetic energy T of 
the top is given in terms of the generalized momenta pg, p^, p x by 
the equation 


2T = 


r COS 2 0 1\ 

TshM + Cr* + 


1 2 2COS0 

A sin 2 0 P *~ Jsin 2 # 2 


The Schrodinger equation for this system is then, by equation (23), 
d 2 t/f cos 0 difj . (A cos 2 0 1 \ d 2 ijj 1 


d 2 ifj cos 0 di/j fA cos 2 0 1 \ d 2 ijj 1 d 2 i/j 
dd 2 + iin 0 + \(7 + shT0 + C]d~<p + sin 2 0 ’ 


2cos0 d 2 i/j , 2AW , 
sin 2 0 W<f>+ h 2 


0. (24) 


This equation is separable if we write 


^0(0) sin ( w <£ +wx ), (25) 

cos 

where m and n must be chosen to be integers if ip returns to the same 
value when (f> and x are increased by integral multiples of 2tt. Substi¬ 
tuting from equation (25) into equation (24) we obtain 

-f- cot 0“~-- (m cosec 0—n cot 0) 2 0-fcr© — 0, (26) 

dd 2 d0 


where we have written a = 


2 AW A 


If we make the substitutions 

u — J(l-4-cos0), 0 = u^ m+nl {u- 

then equation (26) can be brought to the form 

. ..d 2 ?7 f . Q .. ,dU 




■ ocfiU = 0, 


■f F. Reiohe, Zeits. f. Physik , 39, 444 (1926); R. de L. Kronig and I. I. Rabi, Phya. 
Bev. 29, 262 (1927); D. Ml Dennison, ibid. 28, 318, 891 (1928). 
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2a = |w+w|+|m—»| + 1+(l+4cr+4w 2 )l, 

2/J = ]w+w|-f \m— »|4-1—(l+4a+4?i 2 )*, (29) 

y — |w-(-w|4-l. 

It follows immediately from the definitions (29) and the fact that m 
and n are integers that y—1 is a positive integer and that 

a+j8—y ^ 0. 

Originally 6 lay in the interval 0 < 8 <C tt so that 0 < u < 1; if U is 

to be finite in this interval JS must bo zero or a negative integer. This 

will only be so if , /on . 

J ^=J>0+l)-» 2 , (30) 


where 

j r= |{|m+»|+|w—»|}, |(|m+w|4-|»rt~n|}4-l, 


as is easily seen from (29). Hence j must be zero or a positive integer. 
Eliminating a from equations (27) and (3ft) we have finally for the 


energy levels 


">• H ; ' ; , ' > +(s-‘,k 


which is Dennison’s formula for the energy levels of the rotational 
motion of simple molecules. This formula can also be derived in a 
semi-classical way.f 

The wave mechanical theory of a symmetric top which in addition 
to the usual three degrees of rotational freedom has also a degreo of 
torsional freedom between two of its principal parts is of interest, since 
certain molecules, the simplest of which probably are ethylene (C 2 H 4 ) 
and ethane (C 2 H 6 ), are thought to behave in this manner. In this case 
two angles fa and fa are required in place of the <f> above; fa denotes 
the angle between the line of nodes and an z-axis fixed in the lower 
part of the top, and fa is similarly defined for an axis fixed in the upper 
part. The angle of twist is then fa—fa and the potential energy, which 
was zero in the case of the rigid top, is now of the form 

F = L{1—cosm(fa—fa)}. 

The energy levels for this system have been investigated by Nielsen, J 
to whose paper the reader is referred for further details. 

f See, for instance, <J. Horzberg, Infra lied and Raman Spectra of Polyatomic Molecules 
(New York, 1945), pp. 24-6. 

t H. A. Nielsen, Phys. Rev. 40, 445 (1932). 
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15. The orthogonal property 

This property is of great importance for the further development of 
wave mechanics. It will be explained first for particles moving in one 
dimension (along the #-axis). It will be supposed that the particle is 
described by a wave-function ip(x) satisfying the Schrodinger equation 

<» 

It will be supposed that a set of solutions exists either satisfying the 
boundary conditions that ip tends to zero as x tends to ± 00 , or that 
ip vanishes at the extremities of some range of x. (An alternative 
boundary condition which is of importance in Chapter VIII is that dtpjdx 
vanishes at the extremities of some range of x.) Then the orthogonal 
property states that, if ip l9 tp 2 are solutions of (1) corresponding to 
different characteristic values of W , W v and W z , 

J 01 02 ^ === (^) 

the integration being over the range of x considered. 

The proof is as follows: from (1) we have the Schrodinger equation 

- 0 

and therefore + — (W 2 — V)tp£ = 0. 

Multiplying the first equation by ip * and the second by </q and sub¬ 
tracting, we have 

= (3) 

If we integrate from x x to x 2 the second term gives 



The boundary conditions for ip are that either tp or dtpjdx should vanish; 
the term in square brackets therefore vanishes and it follows that 

J ip$ip t dx = 0. 

If the particle moves in three dimensions, we have 

vVi+5w- f ^ = 0 
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with a corresponding equation for ip 2 . We obtain as before 

«vv,-^vv? +‘^(W 1 -w 2 )M* = o. 

On integrating over the space in which the electron moves, the first 



where the variable n denotes the normal component of the vector and 
dS an element of the surface bounding the space. Since either i/r or 
dift/tin vanishes at all points on this surface, it follows that 

JJJ ^*^1 dxdydz = 0. 

If the system contains degenerate states, i.e. a number of states tp v 
ifs s which have all the same energy, it is always possible to obtain 
linear combinations of these, 

‘PI’ = 2 c ss 'As. 

8 

which are orthogonal to each other. Thus all the wave functions of 
the system which correspond to bound stales may be considered 
orthogonal to each other. 


15.1. Expansion of an arbitrary f unction in terms of leave functions 
For the discussions of this section it will be convenient to consider 
only systems for which all the energy values are quantized. This will 
be the case for a particle in a box with rigid reflecting walls (Chap. IT) 
or for a simple harmonic oscillator. It will not be the case for the 
hydrogen atom, unless we suppose the atom to be shut up in a largo 
box with rigid reflecting walls. 

Suppose then a particle moves in a field of potential V (; x , y , z) and that 
the quantized energy-values W n and corresponding wave-functions i/j n 
are given by 2 

+ = 0 , 


where i p n satisfies certain boundary conditions (i.e. that ift n should tend 
to zero as r tends to infinity). Suppose that f(x,y,z) is any arbitrary 
continuous function satisfying the same boundary conditions. Then it 
may be shown that f{x , y, z) can be expanded in the form 

f(x, y,z) = 2 z )- ( 4 ) 

n 

For a proof the reader is referred to Courant and Hilbert, loc. cit. 

The expansion is clearly a generalization of Fourier’s theorem, the 

5072 t* 
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functions y, z) taking the place of the sines and cosines in a Fourier 
expansion. The coefficients can be found in the same way as Fourier 
coefficients. Using the orthogonal relation 

j 'I'S'l’n-d* = 0 ( n ¥= n’) 

and assuming the wave functions to be normalized, so that 

J = 1 > 

then if both sides of (4) are multiplied by i/t* and an integration is 
carried out over all space, we find 

A n =-.jtt*dT. 

The expansion (4) has an important physical interpretation. Sup¬ 
posing f(x, y, z) is a wave function describing the state of a particle at 
a given instant of time. Then if the expansion (4) is made, the coeffi¬ 
cients A n will have the following physical significance; A n is the 
probability that if the energy of the particle were determined, it would 
be equal to W u . 

The assumption that the coefficients A n can be interpreted in this 
way is a new one, and is not inherent in the arguments developed up 
to this point. It is similar to the assumption made in Chapter I, that 
the Fourier component of a wave function gives information about the 
momentum of a particle. 

It will easily be verified that 

2I^J 2 =i- 


16. Transformation functions 


In the last section we showed that if a particle is described by a wave- 


function/(;r), then 


| 


gives the probability that particle has energy W n . In section 7 it was 
shown that 


2 J e~ ipxlh fdoc 


dp 


gives the probability that the momentum lies between p and p~{-dp. 
The functions ^ h _ hipxin 


are known as ‘transformation functions’; they can be used to obtain 
information about the energy or the momentum from a wave function 
giving information about the position of a particle. In this section this 
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procedure will lx? generalized, and a method will be given whereby 
information about any dynamical variable can be obtained. 

Let the dynamical variable under consideration be denoted by 
L{p f x), a function of the momentum and spatial coordinates. Besides 
the momentum p and energy p 2 j2m-\-V(x), the most important variable 
that we shall have to consider is the angular momentum 


yp z ~zp y , zp x -xp s , X py~ypx' 

In this section, however, L will be assumed to be a quite general 
function of p and x. 

If, as in §7, we replace p by the operator ( ?iji)d/dx , then L can be 
regarded as an operator. We may thus set up a differential equation 



-jy 


(id*) 


0 , 


which, if (Ji\x) is subject to appropriate boundary conditions, w ill define 
a set of characteristic functions g L which satisfy the boundary condi¬ 
tions if and only if L r has one of a series of characteristic values. For 
example, if L is the energy 

L = _p 2 /2m+ V, 

(5) is the usual Schrodinger equation. Tf L is the momentum />, the 
equation takes the form . . 


with solutions g — Ae il/xlh . 

It will now be shown that the functions g L (x), the characteristic 
solutions of (5), are the required transformation functions; in other 
words that, if a particle is in a state described by a wave function 

I J 9 lAx)^{x) dx |“ 

is the probability that the variable L has the value L'. 

First of all, let us consider g L \x) as a wave function. Then, if a 
particle is in a state described by a wave function g L \x), the average 
value of L is r ~ n 

J yiA x )L\^ --,xjg L .(x)dx. 

If the wave functions are normalized, this by (5) is equal to L'. Further, 
making use of (5) we can easily show that 

J gt.{x)[L-LJg T ,{x)dx = 0, 

where L as before denotes the operator L ^- 7 d- , x) . It follows that if 
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the particle is in a state described by g /y (x ), then the value of L is 
certainly L\ The functions g L > stand in the same relation to the variable 
L as the wave-functions ifj n (x) do to the energy \jrjm-\-V. 

Suppose now that the particle is in a state described by any wave- 
function We can, as in the last section, expand i/j in a series 

ifj{x) = 2 A L .g J: {x). 

As before, |A ]y \ l will give the probability that the variable L has the 
value L'. But, by the orthogonal property of the functions g jj9 

a l- = J i(x)gi-(x)dx. 

This shows that the functions g L >(x) are the required transformation 
functions. 

16.1. Formula tion of problems in momentum space 

A useful method of solution of quantum mechanical problems consists 
in a transformation of the Schrodinger equation to momentum space. 
This method has particular advantages when applied to the solution 
of problems in theoretical nuclear physics.f The transformation of the 
Schrodinger equation to momentum space is equivalent to applying 
the method of Fourier transforms to the solution of the equation. In 
this section an outline of the method will be given; for a rigorous 
mathematical theory of the representation of the Schrodinger equation 
in momentum space reference should be made to a paper by Gordon.^ 
For simplicity we shall consider a system with only one degree of 
freedom, the extension to higher degrees of freedom being purely 
formal. If we denote the coordinate and momentum of the system by 
x , p respectively, the Schrodinger equation may be written in the form 

g+F(*)}#=rf 

According to the theory we have developed above the momentum p is 
interpreted as a differential operator (hj^d/dx and the potential energy 
V(x) acts on the wave function i/j as an ordinary multiplier. It is easily 
shown by integration by parts that 

00 oo 

j e ipxlh 0 (J'X = p 2 j dx, 

— oo —oo 

t Nils Svartholm, The Binding Energies of the Lightest Atomic Nuclei (thesis, Lund, 
945). 

X W. Gordon, Skand. Matem. Kongr, Stockholm , 249 (1934). 



TRANSFORMATION FUNCTIONS 


69 


§ 16 

so that when the problem is transposed from the ordinary space x to 
the momentum space p by means of the transformation 

00 

Xlp) - 1 / e il)J 'l n i/j(x) dx 

— co 

we may write the Schrodingor equation in the form 

— 00 

By the Fourier inversion theorem 

OO 

«/>(*) = ^ J <’- ijJJll ‘x(p')<{p\ 

— 00 

so that 

00 OO OO 

^ J V(x)e^ ,jrl,i tlf(x)dx = ~ j x(P ) ( fp' j V(x)(! i(l) - J,,)xlh dx. 

— 00 —CO —CO 

CO 

Writing v(p—p') =. ~ J F dx, 

— 00 
oo 

wo obtain (^—w)xO>) + J v{p—p')x(p')dp' — 0. 

— 00 

The solutions of this equation differ according as If is positive or 
negative. If W is negative and we write 

<f>(p) = (i^ a —2mTF)*x(p), 

K(p,p') — --2m(p 2 — 2raJf)“ i (p /2 —2mlF-j|/); 
we then obtain the homogeneous integral equation 

CO 

<f>{p) -= J K{p,p')<f>(p')dp'. 

• oo 

Approximate solutions of this integral equation may be constructed by 
means of the Gauss-Hilbert variational principle and the method of 
iterated functions due to Kellog.f 

In certain cases, however, it is possible to obtain exact solutions of 
this equation. For example if 

V(x) - V" ,,r|/ *> 

t O. D. Kellog, Math. Ann. 86, 14 (1922). 
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f = ^» ah 

J n 2 +(p~P ) 2 


so that if we write p% = — 2m W, A — — 2IJ, m/a 2 the integral equation 
for the wave-function y(_p) reduces to 


00 

(2> s +?>(,) 2 x(/') = ^ f 


iP+iF??** 10 *'- 


If we assume a solution of this equation of the form 


*»> “ 2^-fcji- 


then by direct substitution and use of the result 


a 2 +(p—p')- p' 2 +v 2 


p 2 +(a+v) 2 


we obtain the relation 


(an i-po) 2 ~pi 
p 2 +(an+p?,) 


3 fj -”2'~ 


p 2 \(an+pi)‘ 


Both sides of this equation represent the same function if the coeflicicnts 
c n satisfy the relations 

_ Au 2 

"'(an+pof-pi 0 ' 1 ' 1 ’ 

co 

2 C'n = 0 . 

n — 0 

Substituting from the former of these equations into the latter we 
obtain the relation 


C + C T -t A) - __ 1 __ = 0 

Co+C °Z nl ■(] + 2pJa)...(n+2p 0 la) ’ 

which may be written in the form 

A- p#/ V 2po/ „(2VA) - 0, 

J n (z) denoting the Bessel function of order n. In this transcendental 
equation A and a are known, so the equation may be used to find p Qi 
i.e. the possible values of the energy, W , of the system. Once p 0 has 
been determined in this way the wave function x(p) can obtained 
as an infinite series. 
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17. The elements of perturbation theory 
17.1. Perturbation theory for non-degenerate states 
In this section, following the considerations of § 8, we shall write the 
symbol 11 for the operator 

H ■= _/_7 2 +F. 


If the solutions of a Schrodinger equation 


(H—W)i/j — 0 (6) 

are known, together with the corresponding discrete energy values W n , 
then it is possible to calculate the change in the energy values that 
results if H is changed by a small amount U . The term IJ is called the 
perturbation, and the analysis by which this is done is known as 
perturbation theory. Examples are: the change in the energy of the 
stationary states due to an electric or magnetic field, the interaction 
between the electrons in a two-electron atom, and so on. 

The first-order perturbation may be obtained as follows: let i/j n and 
lV n be the wave function and energy value of a non-degenerate solution 
of equation (6). The perturbed equation is 

(H+U—Wji/j = 0. (7) 

Let us write , f = W n +w, 

<A = </'«+/• 

Then retaining in (7) only small quantities of the first order in IJ, w , 
and / we have _ W„)f-\ ( U-w - 0. (8) 


Now let us expand / in the form 

f-IAAn- 

Equation (8) becomes 

lAJW^-W^.+ eU-^ - o. 

n‘ 


If we multiply by 0* and integrate over all spatial coordinates q we 
obtain r T7 , , 

™ = J K U<l>n d, h (9) 


which is the required result. The coefficient A u * in the perturbed wave 
function can be obtained in the same way: 


jKUfndq 
W n -W« ‘ 


( 10 ) 


As an example, we take an electron moving in a central field with 
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potential energy function Ze 2 /r, and try the effect on the ground state 
of changing Z to Z+l. Then 

U = —e 2 jr 
00 . 00 

and w = j e~ 2 Zr l a — r 2 dr j e -2Zrin r z ^ 

o ' o 

where a = h 2 jmZe 2 . 

TtlZc^ 

On evaluating this gives w = ------, 

n 2 

which may be compared with the exact value 

It will be noticed that formula (9) predicts a big change in the energy 
if |01 2 is large in regions where the perturbing field is large. Thus, for 
example, a change in the field near the nucleus of an atom will lead 
to a much larger change in the energy of the 5-states than the p-states, 
for which 0 vanishes at the nucleus. 


17.2. Matrix elements 


The quantity defined by 

{n'\V\n") = | $*.U'p n .dq 

is termed the 'matrix element’ of U with respect to the two states n\ 
n". It occurs in (10) for the first time in this book. 

From any dynamical variable L(p, q) and any set of transformation 
functions g n {q) a matrix may be formed, of which the element n , n' is 


J gt(q)L(t |p qjg n (g) <h- 


For the Hermitian operators which occur in quantum mechanics, it 
follows that corn pl ex conjugate of (n'\U\ri), 


and also that the diagonal elements are real. 


17.3. Perturbation theory for degenerate states 

Let us suppose that the unperturbed system satisfying equation (6) 
has states 0 a , 0 6 with the same energy. Then the perturbation will in 
genera] split the degenerate energy value into two with energies W-\-w l9 
W-\~w 2 and wave-functions 0i+/ 1? 0 2 +/ 2 where 

<I<1 = 

l/l 2 = A 2 tfl a -\-B 2 tls b . 


( 11 ) 
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In practice there are two ways of finding the coefficients A v B L , etc., 
and the energies w v w 2 . As before, if we insert i/q into the wave equation 

we have (U-w)^+ ( H-W )/j = 0. 

Since i p lf 0 2 are orthogonal, and J 0 2 ^/i vanishes, 

f ifs 2 Ufadq = 0. (12) 

Thus, if we can choose ift v by inspection such that ip v iff 2 are orthogonal 

and (12) is satisfied, then the energies can be obtained from 

w i = J 0 1 f /( Ai M’2 = J >A* Ui/i a dq. 

Alternatively, inserting (11) into (8), multiplying by i/j u , and integrating 
over all , we find AA , L _ w)+Bt ^ = 

Similarly A 1 U^+B^U^—w) = 0. 

Eliminating A lf B x and remembering that U nb — U* a , we obtain the 
following quadratic equation for w v w 2 : 

{U aa -w)(U bh -w) = \U ah \\ (13) 

18. The Rayleigh-Schrodinger method 
18.1. Unperturbed system non-degenerate 

In the preceding analysis we retained only small quantities of the 
first order in U ) w, f. In some problems, however, it happens that the 

matrix element r 

(n\U\n)^ J KUKdr 

vanishes, so it is necessary to examine terms of higher order. Suppose, 
for instance, that it is possible to expand the perturbation U in the 

form U = AH 1 +\W 2 +\*H 3 +..., (14) 

where A is a parameter, and that i p n , W n are the wave function and 
energy value of a non-degenerate solution of equation (6); then we may 
assume that the solution of equation (7) may be written in the form 

*P — ^n+^n+A 2 X„+A 3 0 n +... (15) 

with corresponding energy value 

W = W n +\w n +\*m n +.... (16) 

Substituting from equations (14), (15), and (16) into equation (7) and 
rearranging in powers of A we obtain 

(H-W n )<p n +\[(H 1 -w n )4> n +(H-WM+ 

+H(H i -^ n +(H 1 -w n )4 >n +(H-W n ) Xn ]+... = 0. 
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The term devoid of A is identically equal to zero by equation (6). 
Equating to zero the coefficients of A we obtain the set of equations 

= 0 ’ (17) 

= 0, (18) 


If we expand <f> n in the form 

4>n = 2 ««<•</';> 

i 

then {II W n )<f> n - 2 aJM-WMf, 

i 

multiplying both sides of this equation by ifs*, integrating, and making 
use of the equation r 

we can obtain the result J tfi*{H—W, l )<f> n dr — 0, so that, from equation 
(17), , 

J dr = 0. 

From this it follows that w n — (?i\Il^n). (19) 

Furthermore, 

J ^(H~W n )<f> n dr = 2 a,„ f ipf(H- dr 
= *M-W n ) 

= — J 4 , *{H 1 —w n )if> n dT 

= -um »), 


giving 


and we have 


m k \n) (j . 


W—W 

fr ) rr n 




U\Hi\n) 

W i ~W n ' i 


the dash on the summation sign denoting that the term j = n is 
omitted. 

Substituting from equations (19) and (20) into equation (18), we 
obtain the equation 

(11,-^)^+ 2 ' ^^{lA~(n\fA\n)}4>j+(IJ-WJ Xu = 0 . ( 21 ) 

If we write Xn = 2 Ki </',•> 

i 

then it can easily be shown that 

j r n (H-W n ) Xn dr = 0, 
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80 that, multiplying equation (21) throughout by xji* and integrating, we 

obtain „ — u\ i V' (il ff iW( w l^ 4 li) 


-~ r (« |^ 2 I W )+ 


W —W 

rr n rtj 


Similarly, multiplying both sides of (21) by </i ( * (l ^ n), we obtain 

_ (fi^,i»), v> ( immnM 

Generally only these earlier terms of tlie expansion are used in actual 
calculations; the higher terms become increasingly complicated. The 
final values for the perturbed wave function and energy value are thus 

liV 


4> >/ , «+ a 2 




i X2 v' imi») , v' mmm\n) 
+ 4 \X-ur 4 (H;-H'})(n;,-wi) 


<MO(A 3 ) (24) 


IF - W;+A(»|// 1 |»)+A*j(*|fl r ,|»)+ y' ( - = W1 ^ ( ^)- 

l V ^ "i ' (25) 

In cases for which the elementary method fails, i.e. (?i\U\n) — 0, wo 

have \(n\lI 1 \n)+X\n\U i \n)+... = 0, 

so that tho formula for IF reduces to 


IF = W„ 


+»r- 




w.-w, 


*i+r 


( n\U\j)(j\U\n) 
K-Wj * 


This method of obtaining an expression for the energy of the per¬ 
turbed system was devised by Selirbdinger;f it is the natural extension 
to wave mechanics of Rayleigh’s method J of approximating to the 
modes of vibration of a continuous solid. The procedure adopted above 
is also analogous to the methods of Newcomb and Linstedt§ for the 
solution of the equations of dynamics in terms of trigonometric series. 


18.2. Degenerate systems 

When the unperturbed system is degenerate the procedure is the 
same in principle. Suppose, for instance, that there are k independent 

f E. Sohrodingor, Ann. dvr Phynik , 80, 437 (192(5). 

% Lord Rayleigh, The Theory of Sound, vol. i, }). 113 (1894). 

§ H. Poincare, Mithodes nouvelles de la mecanique c dlente. 
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wave functions of equation (6) corresponding to the energy value W n . 
Denote these wave functions by 

W, $?>, #? ) ,-> 

and suppose that they are normalized and orthogonal to each other and 
to all other eigenfunctions. Then the set of h wave functions 

•Pn, m = I (27) 

3= 1 

(m = 1,2,...,/;) correspond to the energy W n and satisfy the ortho¬ 
gonality and normalization property 

J* t l J n,m l l f 7i,p = $ m p> 

provided the coefficients a mj are chosen to satisfy the relation 

2 a mj a pj "Kip- ( 2 ^) 

i=i 

If we now assume a solution of equation (7) in the form 
then as before we have, on equating the coefficient of A to zero, 

(Hi~W n )^n,m+( H — W n)<t> n> m = °* ( 2!) ) 

If we write as a series of the wave functions, say 

$n t m = 2 (30) 

and replace m by the expansion (27), then we obtain the equation 

2 «Wl(<|flil j)- & u w n) ^ 0 (* = l > (31) 

J--~ i 

by a procedure similar to that employed in the non-degenerate case. 
In equation (31) (i\Hy\j) now denotes the matrix element 

am j) - / 

The set of equations (31) determines the constants a mJ . To determine 
the k values of the energy into which W n is split by the perturbation 
we eliminate the a mj from equation (31) to obtain the determinantal 
equation 

(l|//,|l)-w, m (11/7,12) . . . (ll//#) =0. 

(2|/*,|1) (2|/7,|2 )-w n . . . (2|ff,| k) 


mw mm*) ■ • • (k\im-w n \ (32) 

The determinant on the left-hand side of this equation has, of course, 
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k rows and k columns. In general the equation will have k distinct 
roots. Corresponding to each of these roots w m , there will be a set of 
coefficients a mj determined from the equations (31). These coefficients 
will, in turn define a unique wave-function i/f nm defined by equation 
(27). The perturbation will then, in general, split up the energy W n 
into k separate energy levels W n -\-w m (m = 1,2,..., k) and will determine 
a unique set of wave-functions \fs v m . Once these t/y m are known the 
coefficients oc mj in the expansion (30) for the wave function <f> n m can 
be determined as in the non-degenerate case. 

18.3. A modified form of the Schrodmyer theory 

There are many problems in quantum mechanics in which the 
solutions of the unperturbed Schrodinger equation for the higher values 
of W are not known with any accuracy. We shall see later (§§25,33) 
that the wave functions of the helium atom and the hydrogen molecule 
are known accurately for only the very lowest energy states. If, how¬ 
ever, we wished to determine the wave function of such a system 
perturbed from its normal state by means of equation (24) above, we 
would require to know all the eigenfunctions, not only the one which 
is perturbed. It is, however, possible to increase the accuracy of such 
calculations by transforming the series on the right-hand side of equa¬ 
tion (24) to a form which is more rapidly convergent. 

We may write 

= 2 (j\ H l\ n )'Pj = 2 ' UmnWi+Wnh, 

j j 

so that 


(%- *»>*» + y = y (_L + _ 3 \uj\ih 

k + y W n(w n -w s y i w n {K-w.j)) w 1 

y (i^>) 

z w„-wj y 


Thus equation (20) can be put in the form 

jl _ (Hy—w n )xi> n v' ma\ H i\ n ) 

K - K~ + i { > 

Substituting from equations (19) and (33) into equation (18) we obtain 


+ (H-W n ) Xn = 0. 
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Multiplying both sides of this equation by 0* and making use of the 
result 

J -m - «>„)dq - (n |//f |») - (»| // x |n)* 
we obtain finally 

VHJ(n|// l |i)(i|//,|n) , , 

r ' wm-Wj) ' 

The integral (n\Jl\\n) is as easy to evaluate as (n\H x \n) and the series 
usually converges much more rapidly than those involved in (25) and 
(26). In the case in which (n\U\n) — 0 we get for the energy of the 
perturbed system 

{n\U 2 \n) V' w ^ n W\j)(jW\n) 

w n WM~Wi) “ 


w = w„ 


(35) 


18.4. Lennar(l-J ones' s form of perturbation theory 
A method of obtaining the results of the Schrodinger perturbation 
theory which lias the advantage of indicating exactly what is neglected 
in the above analysis and also of dispensing with the assumption that 
the perturbation is small has been devised by Lennaid-Jones.f 

If we assume that the solutions 0„ of equation (6) corresponding to 
an energy-value W n are non-degenerate, then we may write the solution 
of equation (7) in the form 

<A ^ (36) 

i 

then V<f> = 2 «,• Ufa = 2 a t 2 (j I U | j)^. (37) 

i i j 

Substituting from equations (36) and (37) into equation (7) we have 

2 aM -«')*,+ 2 = 0, 

i i.j 

and equating coefficients of 0 t - we obtain 

a i (W-W i ) = 2a j (j\U\i), (38) 

i 

(j — 1,2,...), the series on the right-hand side having an infinite number 
of terms. The set of equations (38) determine the coefficients a f and 
the possible values of W. To determine the possible values of W we 
eliminate the a 's from this system of equations to obtain the detormi- 
nantal equation 


(1|*7|1)-(*F-H5) (11^12) 

(2|C7|1) (2|17|2)—(IF—TO 



f J. E. Lonnard-Jones, Proc. Roy. Soc. A, 129, 604 (1930), 


(39) 
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the determinant on the left-hand side having an infinite number of 
rows and columns. The values of W v and the wave-functions 

being known, the matrix elements (i\U\j) may be calculated by 
quadratures. When equation (30) has been solved for W the wave 
function may be obtained from equation (3b), after the coefficients n i 
have been found from (38) or from the expansion of a determinant of 
the type 





02 

i/j t ... 

(2|f7|l) 

(2|E7|2)-(H-TF,) . 

(2|E7|i) . . . 

(i|f/|l) 

(». i r/ [ 2) 

. (i|l7|i)-(W r -H5) . . . 


the values of W already found being inserted. 

For definiteness, suppose we wish to determine the perturbation of 
W v Then as a first approximation to the solution of equation (39) we 
may neglect all the matrix elements (i\U\j ) except (11?/|1) and obtain 


w - W, + [l\U\l) 


in agreement with equation (19). To obtain a second approximation 
we neglect all those matrix elements (i\U\j) which do not lie in the 
first row or the first column of the determinant on the left side of 
equation (39). In this way we get the equation 


(1|£/|1)— (IT— HJ) (l|r|2) (1|E7|3) . . . 

(2|[7|1) (2\U\2)-(W-W 2 ) 0 . . . 

(3|*7|1) 0 (3|ff|3)—(IF—»F # ) • • . 


= 0 . 


Expanding this bordered determinant we obtain 
{{\\U\\)-(W-WM-W){W 3 -W)...- 

i j 

by means of a well known theorem on determinants. Dividing through¬ 
out by the infinite product (W 2 —W)(W 3 —W)... and rearranging the 
terms, we obtain as our second approximation 

(l|C7|j)(j|*7|l) 


JF = JFH-(l|E7|l)+2' 


w-w 


(40) 


in agreement with equation (25) except that now W appears in place 
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of W x in each denominator on the right-hand side. Equation (40) must 
be solved by the method of successive approximations, 

W^W 1 +(l\U\l) 

being taken as a first approximation. 

The theory for degenerate states can be developed in precisely the 
same way. 


18.5. Rotating polar molecules in an electric field 
As an example of the application of the second-order perturbation 
theory to a particular problem, we consider the solution of the 
Schrodinger equation for rotating molecules subject to an electric field 
of strength F —a problem of some interest in the theory of dielectrics.! 
If we treat the molecule as a rigid body symmetrical about an axis we 
may then write the wave equation in the formj 





~ 0 , 


where I is the moment of inertia of the molecule about its axis of 
symmetry and p is the permanent electric moment of the molecule. 
We saw previously that the unperturbed wave equation (F = 0) has 
bounded solutions ^ u) = ^ i?(cos 

where c ul is a constant and 

w = ^l(l+ 1 ). 


Moreover, if the wave functions ip are to be normalized to unity we 
musttake / 4 tt Uf (i+*)! U 


CuI \2l+l) 


In the notation of the previous sections we have 

U — pF cos 6 


and 


W= / <l>\ u) Utf>W dr = 2ir[iFc ui c uj J PttoPfirir dp. 


Now it can be shown§ that the integral on the right vanishes unless 
j = i±l, and that 


t Cf. P. J. W. Debye, Polare Molekeln , chap, ix (Leipzig, 1929). 
j Cf. Chap. II, § 14, above. § Cf. Appendix. 
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Thus (l\U\l) — 0 and we must use equation (26) to find the perturba¬ 
tion in W t ; the series on the right-hand side of (26) reduces simply to 
two terms to give 


w = w 1 I ffl^+1) 2 


ft 2 

m+,)+ x -V ('+“)('-») 

(Z+l+w)(/+l— u) \ 

21 

. 2 (j(«-l)(2i+l) 

l)(2Z-J-3)j 


where we have written K = 2lfiF/Ii 2 . In particular the perturbation 
of the lowest energy level (u — l — 0) is 


W = - 


W K 2 

2 /* 6 


These results have been derived by Lennard-Jones using the determi- 
nantal method.t This method is also applicable in the case of large 
electric fields when K can no longer be assumed to be small. 


18.6. Wigner's perturbation method 

It often happens that the infinite series in equation (25) diverges in 
cases in which the lowest energy value itself is finite. In such cases 
the Rayleigh-Schrodinger method is, of course, inapplicable. WignerJ 
has given an approximate formula for the lowest energy which can be 
proved to converge at least in certain simple cases. Suppose we wish 
to determine the energy of the lowest state of the system whose wave 
equation is (H^-XH^ = W<f>, (41) 

when the energy values W n and the wave-functions ip n (n — 1,2,...) of 
the equation with A = 0 are known. Assume a solution of (41) of the 

fOTm * = (0 1 +A j I«,-A,)(l+A 2 1 arf)-' (42) 

for the lowest state. Using the result 

Hit, =1 amm 

i 

it is easily seen that for the wave function (42) 

J >p*H l ifi dr 

(1 |i/11)+A 2 {(11^I j)a,+(j11 7 11 )«*}+A 2 2 2 U\U\k)a k af 
____ 7 — 2 ___ j — 2 k—2 _ 

j 

t J. E. Lennard-Jones, loc. cit., § 6. 

J E. Wigner, Math . u. naturw. Anz. der ungar. Ahad . d, Wiss . 53, 477 (1935). 

5072 


n 
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Similarly it is easily shown that 

2 in;-»><*«, 

j 

Adding the two integrals and writing 

Ej^Wj+XUmj), (43) 

we obtain for the energy of the perturbed system 
W - f </,*(// 0 +A//,)</. dr = 

2 {(/? i -^0«r^+(i|//JiKH'(i|// 1 |i)<)+A 2 2 umafo k 

W i >2 7 2 A* 2 

Al+A -- i+a*"2«F«7 ' ' 

* 2 (44) 

Neglecting A 3 and higher powers of A w T e obtain 

(l-l-A 2 2 afaMW-EJ — A 2 2 {(E J -E,)afa . J +(\|//J 

\ 7-2 7 7 

Differentiating this equation with respect to and setting to 

zero we obtain the relation 

* k (W-K\) - (E'-E^-Hk I//J1), 

which may be solved to give 

a ___ (^l^ill) (45) 

Substituting from equation (45) into equation (44) and neglecting 
powers of A higher than the third, we obtain the equation 

+ A 2 - w=W)(w=w~ (46) 

for the determination of the energy W. 

The value of W obtained from this equation is always too high. 
Continuing this process Wigner obtains the equation 


W—E l = 2 T j+1 (W), 


where 


rp (W\ _ (\ W-»1 V V 0 1^ 1 1^1 1/ X 2) ,,, (M-1 ffi l 1/b’) (1Q\ 

m( )““ () Z’"Z ■ ( } 

rl W 

This equation for W can be solved by successive approximations. The 
series (48) converges in many cases in which the Rayleigh-Schrodinger 
perturbation theory leads to divergent results. It can converge even 
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in the case of a continuous spectrum which has been made discrete by 
imposing a finite boundary on the problem (as in the bounded linear 
oscillator § 11.2, above). The series (4K) was first obtained by Brillouin,t 
who obtained it by an intuitive consideration of the equations of the 
Schrodinger method. 

f L. Brillouin, Jouni. </. Phyx. 4, 1 (1933). 
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EFFECT OF A MAGNETIC FIELD AND THE 
ELECTRONIC SPIN 


19. Effect of a magnetic field on the energy levels of atoms 
19.1. Effect of a magnetic field 

In this chapter we shall consider the effect on a hydrogen or alkali atom 
of a magnetic field. It will be shown that the observed phenomena 
cannot be explained by the assumption of the point electron, and that 
we have to introduce the assumption of the ‘spinning electron’. This 
assumption amounts to the following: the electron, besides its positional 
coordinates, has a fourth degree of freedom; a quantum number cr z is 
introduced to describe this fourth degree of freedom. In the absence 
of a magnetic field the energy levels are unaltered by the spin, but in 
a magnetic field 11 the spin introduces an additional term into the 



All energy levels are thus split. In addition the spin makes an addition 
to the resolved part of the angular momentum of the system along the 
magnetic field equal to 

In this chapter these hypotheses will be introduced in order to explain 
the experimental facts. In Chapter XI it will be shown that, in order 
to obtain a consistent quantal theory of the electron obeying the 
principle of relativity, it was necessary to assume that the electron 
had these properties. This striking advance in the theory of the electron 
was due to Dirac (1928). In this theory the properties of the spin and 
the relativistic correction to the earlier non-relativistic theory are 
intimately connected, and it is no longer true to say that the spin does 
not affect the levels except in a magnetic field. 

The effect of a magnetic field on the energy levels of atoms can be 
investigated by the following methods: 

(a) Experiments of the type carried out originally by Stern and 
Gerlacht (1924). In these a beam of atoms is passed through an in¬ 
homogeneous magnetic field. Suppose the ground state of the atom is 


f For further details of this method, see any book on magnetism, e.g. F. C. Stoner, 
Magnetism and Matter (London, 1934). 
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split by a field H in the ^-direction into a number of states, with energies 
differing from the original state by 

fxHu , 

where u is a quantum number having a number of discrete values. 
Then the force on an atom in the state u is 

m 


The beam is thus split by an inhomogeneous field. The number of 
beams into which it is split gives the number of possible values of u, 
and the magnitude of the splitting gives the magnitude of n'dHjdz. 

(b) The Zeeman effect. Emission or absorption lines in the line spectra 
of elements are frequently split by a magnetic field into a number of 
components. On the quantal interpretation this means that the energy 
levels are split. Unlike the Stern-Gerlach experiment, the Zeeman 
effect gives information about the excited as well as the ground state. 

(c) Paramagnetic salts. The susceptibility of a salt containing N 
magnetic ions or atoms per unit volume is, if interaction between them 


can be neglected, 


N V V 

v JcT Z 


7 . 2 , 


where the summation is over the v states of the ion. /x and u refer of 
course to the ground state of the ion. For the derivation of this formula, 
cf. Stoner, loc. cit. 

It will first be necessary to evaluate the angular momentum of an 
electron in a hydrogen-like atom. 


19.2. The orbital angular momentum 

In this section it will be shown that if an electron or other particle 
is described by a wave function 

#*,S hz) - R(r)®(0)e<*+/y/( 2tt), (1) 

where B(r), Q(6) are any normalized functions of r and 6 , the com¬ 
ponent of the angular momentum along the axis of spherical polar 
coordinates is uh . 

In Newtonian mechanics the components of the angular momentum 
M xi M y , M z , are given by 

M x = yp z -zpv< 

My = Zp x — Xp z , ( 2 ) 

M z = xp v -yp x . 
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Let us take the ^-direction as the direction of spherical polar coordi¬ 
nates. Then we can show in an elementary way that the mean value 
M z of M z is uh , for, by (2) 




Transferring to polar coordinates this gives 

dip 


ttf h 

ill 

~ i 


r f </>* TJ <lr, 

i J d(j) 


and if ip is of the form (1) this reduces to 

M s — uh. 

To show that M z for a particle in the state corresponding to the 
wave function (1) actually has the value uh, we must make use of the 
transformation theory of §16. We need a series of transformation 
functions g M J(X,y,z)\ then 

j / ?/> y, ~) *j 2 (3) 

will give the probability that M z has the value M' a . These transformation 
functions are defined by the equation 

h 


h (x ° g M:<) 0; 

i \ by y dx) 


in polar coordinates this gives 

h by 
i dcp 

The normalized solution is 


-M' B g - 0. 


g = e^+MPn), 

where M' z — u'h, 

and the condition that g must be single-valued shows that u must be 
integral. 

Making use of (1), it will be seen that (3) vanishes unless u = u\ 
Thus if an electron is in a state corresponding to the wave function (1), 
the angular momentum has the value uh. 


19.3. The total angular momentum 

In this section it will be shown that if an electron is in a state with 
wave-function 


R(r)(r)(8)e i '"t‘/J(2TT) 
the total angular momentum is 


( 4 ) 


<J{l(l-\-l)}h. 
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The total angular momentum M is given in Newtonian mechanics by 
M* = Mi+Ml+M*. 

In quantum mechanics, J/;. is represented by the operator 
-h*Li- Z *)(y±- Z ±) 

\ & «W\ & W 

>J o b 2 . 9 # 2 0 C ) 2 C) b\ 

— — n* hr—-4-z 2 -—2yz - y - z I. 

\ bz 1 by 2 ' bybz J by bz) 

M 2 is therefore represented by the sum of three such terms. We have 
to transform to spherical polar coordinates. This can be done as follows: 
it is easily verified that 


M 2 - h 2 


-^+(Z "£) , + Z«£ 

\r »/ « ' it « 


// 2 [—r 2 V 2 + ( r grad) 2 +( r grad) ] 


- A* 


- h 2 


— r 2 V 2 +r 2 


— + 2r— 


1 r) 
sin 0 bO 


(sin 6 C 

\ & 


, 1 A 

,sin 2 0 b<f > 2 J 


The transformation function <f>) is thus given by 

1 bl. n by\ 1 b 2 a , J/' 2 

With the boundary conditions that y must be single-valued and bounded 
the solutions are />;f(cos 0)e l,t '* 

with M’ 2 ih 2 i(i+ 1), (5) 

and l a positive integer. It follows as in the last section that, for an 
electron in the state with wave function (4), M 2 (the square of the total 
angular momentum) has the value 1(1+l)h 2 . 


19.4. Energy levels in a magnetic field 

We shall now calculate the energy levels of an electron in an atom 
when subjected to a magnetic field, without taking account of the spin. 
The Schrodinger equation is (cf. p. 40) 

W + ^(Agrad0) + ^(» V-VW - 0. 

The second term is to be treated as a ‘perturbation’, according to the 
methods of the perturbation theory. 
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For the magnetic field we take 

H x = 0, H y = 0, H Z = H. 

This has the vector potential 

A x = —\Hy, A y = \Hx, A z — 0. 

Our perturbing term therefore becomes 

2 me \ dy * dx) 

or, introducing polar coordinates with the z-axis as axis of coordinates, 

v = aw*. 

with w = eU/2mc. 

Since the wave functions of the electrons are of the form 

J?(r)0(fl)e^/V( 2 tt), 

it is clear that all matrix elements of U vanish except the diagonal 
ones, and that 27T 

(u'\U\u") = ihw-- f 

2tt J dep 

o 

A state of azimuthal quantum number l is thus split into 2Z-f 1 states 
with energies displaced by 

■z—Hu (*= —1),— J). (6) 

zme 

This result was also obtained on the Bohr theory. It is easily verified 
on the classical theory that an electron revolving in an orbit with 
angular momentum I has magnetic moment el/2mc. If I is equal to 
Ifi, the magnetic moment is lefi/2mc. If we now assume that the resolved 
part of the angular momentum along the field is also quantized and 
equal to uli (u = — l Z— 1, l), then it follows that the resolved part of 
the magnetic moment is ueh/2mc . Formula (6) for the energy change 
follows. 

A magnetic moment equal to eh/2 me is called a Bohr magneton and 
denoted by ix B . 

We see therefore that 5-states (l = 0) should not be split by a 
magnetic field, apart from the electronic spin to be considered below; 
^-states are split into three states separated by an energy interval fx B H . 

If an atom has two states with energies W v W z , transitions between 
which give rise to a line of the emission spectrum of frequency v 0 given by 

hv 0 - W 2 -W v 
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then according to the above theory the frequency of the lino in a field 
H will be given by y 0 -j-Av, where 


Av: 


eH 

irrmc' 


A u. 


(7) 


A selection rule proved in Chapter X shows that Aw = 0 or ±1. Thus 
any lines (except s-s transitions which are actually forbidden) should 
split into three lines separated by frequency intervals eiT/^mc. 

Splitting of this type is known as the Normal Zeeman Effect; it is 
actually shown only for singlet terms, e.g. parhelium. The formula for 
Av does not contain h , and should thus be derivable from the classical 
theory of a vibrating electron. The proof of this is instructive, and is 
given below. 

Suppose that an electron is held in position by a restoring force —pr 
when it is displaced a distance r. Then, in the absence of a field, it 
will vibrate in any direction with frequency v given by 


„ = /£. 

27 t\j m 

Suppose now a magnetic field II is introduced in the ^-direction; the 
electron can still vibrate in this direction with its frequency unaltered, 
but the vibrations perpendicular to the field w r ill clearly be affected. 
The normal modes of the vibration are thus circular orbits in a plane 
perpendicular to the field. If r is the radius of such an orbit, we have 
for the components of force on the electron 
restoring force —pr, 
centrifugal force 
force due to field i Heojrjc , 
where to is the angular velocity (= 2m>). Thus 

p — mu > 2 = ^Heco/Cf 

giving approximately oj = J 

Formula (6) for the frequencies follows, as does information about the 
polarization of the three components (circular polarization of the outer 
components and disappearance of central component if the emitting gas 
is viewed parallel to the field, linear polarization of all three if the gas 
is viewed perpendicular to it). 


20. The electronic spin 

Many experimental facts have made necessary the introduction of 
the assumption of the electronic spin. These include the gyromagnetic 
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effect, the anomalous Zeeman effect, the results of the Stern-Gerlach 
experiment, and the existence of doublets in the spectra of the alkalis. 
In general it may be said that the number of states of the single-electron 
system (e.g. alkali, silver, or hydrogen atoms) is twice as great as it 
would be without spin. This is especially striking in the case of the 
ground state, which is an s-state (l — 0) and should therefore be non¬ 
degenerate. The original Stern-Gerlach experiment showed, however, 
a splitting of the ground state of silver into two, with a separation given 
by (eft///2mc). On these grounds alone a spin must be introduced. 

We may obtain a formula for the component of angular momentum 
as follows. For electrons without spin we have deduced from the 
properties of the Legendre polynomial cos0) the following properties 
of the orbital angular momentum: 

(a) The total angular momentum is y/{l(l +1 )}//•. 

(b) The component of angular momentum along the axis of coordi¬ 
nates has the value flu where u has (2Z+1) integral values. 

Since experiment shows a twofold splitting of the energy states due 
to the spin, then if s is the quantum number giving the angular 
momentum of the spin, it is reasonable to assume that (2s-}-1) -- 2 
and thus that s = J. Hence it follows that 

(a) The total angular momentum of a spinning electron is 

vw*+ip-~ y 3*. 

(b) The component of angular momentum in the direction of the 

field is ti s where ^ ^ 

From the experimental point of view we may determine the com¬ 
ponent of angular momentum in the direction of a magnetic field II 
by means of the gyromagnetic effect. This measures the change in the 
angular momentum of a bar of magnetic material when the direction 
of magnetization is reversed; the bar thereby receives a small impulse 
causing it to rotate about the field. If the magnetism is due to orbital 
motion and not to spins, we have for the ratio 
angular momentum __ 2 me 
magnetic moment e 
For spins the ratio is halved. 

The first experimental measurements of the gyromagnetic effect were 
carried out with ferromagnetic materials; we have here no a priori 
evidence as to whether the magnetism is due to orbital or spin motion. 
The observed values of the ratio ( |. 2mcje) show that the magnetism is 
due to spin. 
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20.1 . Doublets of the alkali atoms and anomalous Zeeman effect 

Owing to the spin , all states of the alkali-like atoms except the estates 
are doublets; the s-states in the absence of a magnetic field are not split, 
and so have a twofold degeneracy. This can be seen in a general way 
as follows: the electron, since it is moving in an electric field e/r 2 , is 
acted on by a magnetic field of order ev/cr 2 . In this field the energy of 

the electronic spin is of order , 

1 eh ev 

2m c cr 2 ’ 

which, if r is of the order of the radius of the hydrogen atom (/i 2 /me 2 ), 
gives a separation of the order 

~ X total energy, 


or about one part in 10,000 of the total energy. In s-states, however, 
there is no axis of symmetry for the motion and hence no change of 
energy due to the spin. 

Calculations of the doublet separation can be made (a) on the basis 
of the Pauli-Darwin theory of the next section; (6) on the basis of the 
Dirac theory of the electron (Chap. XI). The latter is, of course, the 
more complete theory and should give a correct result. 

The effect of the spin on the magnetic moment of an atom and hence 
on the Zeeman effect can also be worked out on the 
basis of either of the above theories, for a single 
electron moving in a central force. We give below 
a more elementary derivation due to Lande,*)* which 
is applicable also in a many-electron atom, where ~ 
several eleotrons may combine together to give a total 
spin angular momentum 8. Let also L be the total 
angular momentum due to orbital motion, and J the 
resultant angular momentum of the whole atom. 

Then the vectors 8, L f and J will form a triangle as 
shown in Fig. 15. The component /x of the magnetic moment along J 

will be given by e 

a - (2S cos B+L cos a), 

2mc 



since the spin gives two Bohr magnetons for each quantum h of angular 
momentum. We thus have 


t 1 


2 me 


Jg, 


I A. Lando, ZeMs.J. Phyaik, 19, 112 (1923). 
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where 


<7 = 



8 cos/ft 

~r 


The quantity g is called the Lande {/-factor. 


Since 


cos/? = 


2 SJ 9 


n , Tl , S 2 +J 2 -L*] 

we find 7-1-4 --——- . 

J [ 2J 2 J 

Putting in the quantum mechanical value for the angular momentum 
in terms of the quantum numbers, we find 

*(a+l)+j(j+ 1 ) — ?(/+ 1) 1 /g\ 

2j(i+l) J’ 

If the atom is now placed in a magnetic field //, the component of the 
angular momentum along the field will take up the following values 

j>j — 1 ix/i, 



giving 2j-\-l terms if J is integral or 2j if j is half an odd integer. In 
either case the energy interval between two adjacent terms is 

AW = < ~—g- (9) 


21. Wave functions for the spin 

Wave functions describing the spin may be introduced in the follow¬ 
ing way. We take an arbitrary direction in space, for instance the 
2 -axis of Cartesian coordinates. If a magnetic field H is applied in 
this direction, an electron in an atom will be found to have changed 
its energy by where \i B is the Bohr magneton (eh/2wc). Which 

value the energy should have is the only information that we may hope 
to obtain about the spin; we therefore introduce a variable <j z , which 
can take only the values ± 1 so that the energy of the electron in the 
field is n / in 


\ha z is then the component of angular momentum along the field. The 
complete wave function of an electron will then be 


0(^,2/, 2; O z ) ( d z = ±1). 
The interpretation of this wave function is that 

I '!'(*,y,*-, +i)l 2 rfr 


is the chance that the electron described by the wave-function $ is in 
the volume element dr and that its spin would give an energy —/z^I/ 
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where 


if a field were set up in the z-direction, with a similar interpretation 
- 1 . 

Since the interaction between the spin forces and the orbital motion 

of the electron is small, it is a good approximation to take ifj(x, ?/, z; aj 

in the form ,, v . x . 

'l’(v,y,z)x((r s ) (a s = ±1), 

where is the solution of a Sclirodinger wave equation for an 

electron without spin, and x( a z) a function of the spin coordinates 
only. 

In a magnetic field along the z-axis, the spin can have tw T o stationary 

states. If the energy is —fji n Ji we shall denote the spin wave function 

by , v 

xMz)> 

where X(x (\) - 1, Xa (-i) = 0. 

If the spin energy is //, we denote the corresponding wave function 
by 

where X^ 1 ) = 0, xM -1 ) = l - 

Problems involving the spin are of two types: 

(a) Those involving the symmetry of the wave function in the many¬ 
body problem. These may be handled by the use of the wave- 
functions Xa , xp without the introduction of a- wave equation. 

(b) Problems involving the interaction of the spin with a magnetic 
field; for these we need to set up a wave equation. 

In the next two sections, therefore, we will show how to build up 
a wave equation for the electron with spin. This equation w r as first 
obtained, by somewhat different methods, by Parwinf and Pauli.J 
It is a non-relativistic and therefore only an approximation to Dirac’s 
relativistic equation (Chap. XI). It is nevertheless worth while to 
develop this approximate equation for the following reasons: no satis¬ 
factory relativistic equation has been given for the interaction of two 
electrons in an atom; also the more elementary theory still has to be 
applied to nuclear particles (e.g. the proton and heavier nuclei) which 
have mechanical and magnetic moments but certainly do not obey the 
Dirac equation. 

The extra term in the energy of an electron due to its spin should 
be, according to classical mechanics, 

ftiH) + I(|i[Ev]). (10) 

c 

f C. G. Darwin, Proc. Roy. Soc. A, 116, 227 (1927). 
t W. Pauli, Zeit8.f. Phymk , 43, 601 (1927). 
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Here fx denotes the magnetic moment of the electron, H the magnetic 
field and E the electric field in which the electron moves, and v its 
velocity. It will be noted that [Ev]/c is the apparent magnetic field 
due to the motion of the electron through the electric field E. 

Since (x (e/mc) S, where S is the angular momentum, (10) becomes 

£{(SH) + I(S[EV])}. (1» 

To set up the wave equation for an electron with spin, therefore, we 
should add (11) to the Hamiltonian function. Our first difficulty, how¬ 
ever, is that we do not yet know how S should operate on i/j. We have 
shown in previous chapters that the angular momentum due to orbital 

motion appears in the Hamiltonian as an operator x~ — etc., but 

we are now concerned with the internal degrees of freedom of the 
electron, and we do not wish to introduce coordinates to specify its 
orientation in space when in fact only the two values of <j z are 
observable. 

We shall therefore make use of the methods of matrix mechanics, 
here introduced for the first time. 

21.1. Matrix mechanics 

The matrix corresponding to any dynamical variable L(p,q) and any 
set of transformation functions g n (q) has already been defined. We write 

{n'\L\n") -= j g%(<l)L{p, q)<j u ~(q) dq, 

where p is, as usual, to be replaced by the operator —ifod/dq. The 
aggregate of numbers (n'\L\n") for all values of* n\ n" is said to form a 
matrix, which may be written thus (if n takes the values 1, 2, 3...) 

(l\L\l) (l\L\2) 

(2\L\l) (2\L\2) 


As proved in Chapter I, §7, the matrix elements have the Hermitian 
property {n'\L\n") = (n"\L\n')*. 

The diagonal elements (ft/[Z|tt') are real. 

An important property of these matrices is that, if L(p,q), M(p,q) are 
any two dynamical variables, then 

(ri\LM\n”) = 2 (n'\L\n m )(n"'\M\n'y 

n m 



(12) 
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This is easily proved as follows: the right-hand side of (12) may be 
written 

d( l d( l'' ( 1 ^) 

But since the functions g n (c /) form a complete set of orthogonal 
functions, we can expand any function F(q) in terms of them; thus 

F(q) -- J,A n g n (q), 

n 

where A " ~~ j F (<i)V?M) <h- 

Thus F(q) ■, 2 f F(q')g*W)g„(q) <¥■ (14) 

Applying (14) to (13) we see that (13) is equal to 

/ g*Aq)FMg,Aq)cFh 

or, in other words, to the left-hand side of (12), which is what we set 
out to prove. 

It will be seen from (14) that 

I g*M')gn(g) = 0 , in 

= 00 (q = q') I 

and that the series behaves like the ‘8-function’, S(.r) introduced by 
Dirac. This has the property that 

J f(x')h{x-x') fix' = f(x). 

A further theorem of importance, and the one that we shall use in 
the development of the theory of the spin, is the following: suppose 
a system has the Schrodinger equation 

H(p,q)i/t(q)—Wifj(q) = 0 (10) 

from which the allowed energy values W are determined. Then if g n (q) 
is any transformation function and 

x (n') = J g*(q)<l>(q) dq, 

so that |x(«')l 2 is the probability that, for the system in the state \ft, 
the variable n has the value n', then (16) is equivalent to 

'Zin'\H\n") x in")-W x {n') = Q. 

41 * 


(17) 
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The proof follows the same lines as that given above, as follows: we 
have 

2 (n'\H\n") x (n") = 2// 9 *{Q)H(p > g)g n .(q)^.(q') 4 >(q')dqdq' 

n" n m J J 

= j&(q)H(pmq)dq 

= W jgl(qW(q)dq, 
which proves the desired theorem. 

As an example of this procedure, the function g n (q) may be the 
transformation functions relating the 2 -component uh of the angular 
momentum to the polar coordinate < 56 , so that 

g = e iv +M(2n). 


21 . 2 . Matrices for the spin moment 

In dealing with the electronic spin, the matrix form of the wave 
equation (17) is very convenient, and saves us from the necessity of 
introducing internal coordinates of the spherical polar type to describe 
the rotation of the electron. The coordinate n in this case will be the 
2 -component 8 Z of the angular momentum which is known to take the 
values only. 

It is convenient as before to introduce the quantity o z such that 
\o z his the component of the angular momentum in the 2 -direction, and 


2mc 

the energy in a magnetic field. Then 

a z — db 1 • 

Corresponding to our wave functions x( n )i we then describe the spin 
by the wave function already introduced 

xK)- 

Any quantity L in the Hamiltonian must thus be expressible as a 
matrix 


and Lx denotes 


W Z \L 10 , 

lW z \LK)x(°l)- 

Ol 

We thus have to find a matrix expression for the spin angular momen¬ 
tum S of the electron which occurs in ( 11 ). 

Consider the orbital angular momentum M of a particle in a state 
with quantum number l. We can then form the matrix of the angular 
momentum 

(u'\M x \u") = P?'(cos 0)6-™*(yj z - z^JP^cos 0)e<"'*sin 6 ddd*j>. 
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For the 2 -component the evaluation is elementary; 


{u'\M z \u") = ~ f Pf(coa6)e~ in '^~ Pj‘'(cos 6)e iu ’^ sin 0ddd<f> 
l J cm 1 


= h.u' (u r — u”) 

l ' 

= o (u f ^ u"). 


(18) 


and direct evaluation gives for the other elements 

(u+l\M x \u) - lhj{(l-u)(l+l+u)} = (u\M x \u+l)< | 

(u+l\M y \u) - -lM^{(l~u)(l+l+u)} - ~( u \M u \u+ 1 ) / %) 

with all the other components of the matrix vanishing. 

These forms are deduced for orbital motion, and thus for integral 
values of l and u. We shall, nevertheless, obtain matrices for the spin 
angular momentum S by setting / — J, u = S z -- ± l in these formulae. 
Introducing as before the matrix 

<7 — (cjl|ar|cr!) “ 2 $//i 


we obtain thus the following values: 

hw). Hi-j- <-> 

An alternative way of obtaining these matrices is by means of the 


relation 


M X M V - 


-M V M X = 


h 


,M Z , 


which is easily verified if M has the form of the operator ( 2 ). It is 
therefore true also as a matrix equation if M refers to the orbital motion. 
Let it be assumed that the same equation also applies to angular 
momentum of the spin. In that case, if we write as before M = \ha 
we have _ ___ ( 21 a) 

( 216 ) 

( 21 c) 


a x Qy- 

°z°x 


■ °v a x = 2 icT z , 
=* 2ia U> 

2icr r . 


-<r s <r« 


Since 


-S -,) 


the equations ( 216 ) and ( 21 c) show that the diagonal elements of a v 
and a z vanish. For the other elements we obtain from (216) 

(—1|°*|1) = *( l| a 2/|l) ( 22a ) 

and from (21a) 

(— 1 Jcr^.11 )(1 \o y \ 1) ( 1 Icr^l 1)(1 \<j x \ 1) - -2 i. 

H 


6072 


( 226 ) 
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Using the Hermitian relation, that 

(ik„l-i) = (-iKIi)*. 

and substituting from (22), we find 

l(-l|«.|l)l*= 1. 

This gives for a x o x = * j e ia , 

where ot is an arbitrary phase. The value of this phase will not affect 
any quantities that we may have to compare with experiment. a y may 
be found in a similar way. 

It should also be noted that the three matrices a x , a yi o z together 
with the unit matrix 

h = 

form the basis of an algebra of 2 X 2 matrices; for any arbitrary matrix 
of this type 

can be written as a linear combination 




(® = l(a+d)I 2 +Ua-d)v s +W+c)a x +li(b-c)^ 

of o x , Gy, G z , and / 2 . 

Furthermore, if we write x a an< i Xp i* 1 ma trix form we have 



In this way we may establish all of the relations 
ff *Xa=Xj3> °*Xj3 = Xa 

a v Xa = ix/l’ a vXp = —*X« • 

a zX a = Xa> *^X/5 = —X/5 > 


(23) 


21.3. Pauli's wave equation for an electron with spin 
The wave equation can now be written down. For a point electron, 
for a magnetic field H and electrostatic potential energy V, it is 

_ .(Agrad j) = Wf. (24) 

2m T T mci 

The additional terms in the energy due to spin are (eq. (11)) 


— (HS) + — (S[Ev]). 
me me 1 
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Replacing v by p/m and S by -kTia we have to add to the left-hand side 
of (24) 

K E grad lA])} • (25) 

if* is of course of the form a s ), and 

ori A = 2 

Cg — l, —1 

With this Hamiltonian we shall solve the following problem. 

Suppose that a beam of atoms in 2 S states is passed through an 
inhomogeneous magnetic field (Stern-Gerlacli experiment), the field 
being in the gg-plane such that H r = H cos 9, H s = II sin 6, The beam 
will be separated into two by the field. Let the wave functions of the 
spins in these two states be 

Xoc = A X«( a z) + li Xp( a c), 

x'p A 'x«.( a z)+ B 'Xfi( a =)- 

Our problem is to find A and B y and hence the numbers of atoms that 
would be found parallel and antiparallel to a field if it were suddenly 
switched to the ^-direction. 

The conditions that the two functions and ^ should be normal 
and orthogonal gives 

A*A'+B*B f = 0, 


We thus set 


AA*+BB* - A'A'*+B’B'* =■= 1 
A — cos A B = sin A e iv , 

A' = — sinAe*/ 4 , B ' = cosAe ?v . 


(26) 


Further, since x' a > Xp represent the two stationary states in the field H , 
the non-diagonal matrix element x*(Ho)xp must vanish. This gives 

(A*A’-B*B')H Z + (A*B r +B*A')H x = 0. 

Substituting from (26) we see that /x = v and 

H z sin 2A = H x cos 2A. 

Thus tan2A = H X IH Z = tan 6, 

and thus A = \0. 

Thus the relative numbers of electrons that will point parallel and 
antiparallel to a field suddenly imposed along the 2 -axis are 

cos 2 |0, sin 2 |0. 
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22. Evaluation of the doublet separation in alkali atoms and the 
anomalous Zeeman effect 

The Schrodinger equation for a point electron in the absence of a 

magnetic field is , 2 

*LVV+F<£= Wib. (27) 

2m 

The perturbing terms due to a magnetic field H along the z-axis and 
to the spin are 

u * “ (2S) 

For a given value of l, there are 2Z-f~ 1 solutions of (27) all with the 
same energy, and 2(2l~\-l) solutions when the spin coordinate is intro¬ 
duced. These will be split into 2(2Z-f-1) non-degenerate states by the 
perturbing term (28). Fortunately the 2(2Z+1) unperturbed wave 
functions can be split into pairs ifi a , ifj () such that the non-diagonal 
elements of V with respect to states in different pairs vanish. Each 
pair corresponds to a given value of the z-component j z of the total 
angular momentum; the wave functions are 

4> a = 

For these j z ~ u-\- 1. It will easily be verified that elements of the type 
Uz\U\jl) vanish if j' e ^ j" z . 

Neither i/* a nor ifj f) is the correct wave function for our state, unless 
a strong field is applied sufficiently great to break down the spin-orbit 
interaction and orient the spin along the z-axis instead of along L ; 
our wave functions will be of the form 

Ata+Bfa’ 

A'ta+B'h 

and the contribution w to the energies will be given (cf. p. 73) by the 
quadratic equation 

(U n —w)(U 22 —w) — |ty 2 . (29) 

The matrix elements can be evaluated as follows: E is given by 


eE x — 


xdV 
r dr ’ 


SlEgrad 


and thus 
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where M is the orbital angular momentum. Thus, if we write 

00 

dV ( 
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2 me 2 


si 


dr 


{.K(r)} 2 r 2 dr = P, 


we have for the matrix elements of 

eft 2 


2 m 2 c 2 


(o[Egrad]) 


with respect to the wave functions 

p / M ,y/{Z(Z-f 1)—w(w-fl)}\ 

\>y/{/(Z-fl)-W(^+l)} -(^+1) / 

making use of the relations (19) for the matrix elements of M. 
Equation (29) thus becomes 

{w—— Pu}{w—ixlIu-\-P(u-\-\)} = P 2 {l(l+l)—u(u+l)}. 

(30) 

If Ii vanishes, this gives 

w 2 -\ Pw - PH(l+1 ), 

whence w — PI or w = — P(Z+ 1 ). 

The doublet separation is thus 

(2Z+1)P. 

Since P is of order 1 /c 2 , we do not expect a non-relativistic theory 
to give a quantitatively correct theory; compare the value given by 
the Dirac equation, Chap. XI, §56.2. 

Turning now to the shift of the levels given by a magnetic field, we 
see at once from (30) that a strong field (Hp P) gives 
w — fxH(u+l) or [jlHu. 

In this case the spin-orbital coupling is broken down (Paschen-Back 
effect). For weak fields 


1 / dw \ 
fx\dH) 


h-»o 


(2u+l) 

, 1+1 
i+i 


W+P 
2 w+P 


Js) 


or 3z 


l 

i-Y 


The Lande (/-factor is thus given by 

' g=(l+mi+l), j = l+h 

g = j = l-b 

It will easily be verified that the usual factor reduces to this form when 
5 = 4 (i.e« for one electron). 



V 

THE TWO-BODY PROBLEM 


23. Introduction 

In this chapter we consider the problem of two particles, moving in 
one another's field and also in some external field. Problems in this 
category are: the hydrogen atom when the motion of the nucleus is 
taken into account, the helium atom and the hydrogen molecule, the 
scattering of a-partieles by helium nuclei. The most interesting features 
of the problem arise when the two particles are identical, i.e. two elec¬ 
trons or two a-particles; in this case we shall find that only half the mathe¬ 
matically possible solutions correspond to states that occur in nature. 

We shall first develop the theory without taking into account the 
electronic spin. We shall suppose that we have two interacting particles, 
with masses m v m 2 , and that their coordinates are 

Ti = (x v y 1 ,z 1 ), r 2 = (x 2 . y 2 , z 2 ). 

Then we have seen in § 8 that the state of the system will be defined by 
a wave function ip(r 1 r 2 ) of the coordinates of both particles, and that 
the interpretation of i/j is that 

!lA( r i r 2 )| 2 <? T idr 2 

is the probability that the first particle is at the point r x in the volume 
element dr v and at the same time the second particle is in the volume 
element dr 2 at the point r 2 . The equation satisfied by i/j is 



02 02 02 
Y72 V „_ i l 

2 dxy dyy dz\* 


and F(r 1 r 2 ) is the potential energy of the system when the first particle 
is at v x and the second particle at r 2 . 


23.1. The hydrogen atom with motion of the nucleus 
We denote by r v r 2 the coordinates of the nucleus and electron 
respectively, and by m v m % their masses. The potential energy of the 
system is —e 2 /r, where 

V = |r x —r 2 | = ^{{x 1 -x i ) 2 + {y 1 -y i f+(zi~z 2 ) 2 } 
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is the distance between the electron and the nucleus. To solve equation 
(1) we introduce the coordinates of the centre of gravity of the system 

R == (m 1 r 1 +m 2 r 2 )/(w 1 +m 2 ) 
and the distance between the particles 


The equation then transforms into 


where 


\7 2 -- d 2 | d 2 d 2 
R dX^dY 2 ^ dZ*' 


and M = m 1 +m 2 , m = m 1 m 2 /(m 1 +m 2 ). 

The equation is now separable. If we write 

i/j = 0i(R)^ 2 (r), 

0 is a solution of the equation if i/q, i/j 2 satisfy the equations 

,4 -Jr “ 23? 

ctf 2m ™ 


( 2 ) 


(3) 


The first is the equation for the wave function of a free particle of 
mass M\ the solutions represent the motion of the hydrogen atom as 
a whole. The second equation is the Schrodinger equation for the 
hydrogen atom; from it we deduce that the energy levels of the hydrogen 

atom are given by 4 , 

W = „ 

2h 2 n 2 ’ 

where m is not the mass m 2 of the electron but 

”•*/(' + S;)' 

The same result was obtained on the old quantum theory (§ 9). 


23.2. Rotational and vibrational states of the diatomic molecule 
Suppose the Schrodinger equation for the electrons in a diatomic 
molecule to be solved for the case when the distance between the two 
nuclei is r. The energy of the electrons in the lowest quantized state 
will be a function of r; we may denote it by W{r). If we add to this 
the potential energy of the nuclei in one another’s field, we get an 
expression for the total energy that the system would have if the 
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nuclei were held rigidly at rest at a distance r apart. We denote this 
energy by V(r). For stable molecules we expect a curve with a minimum, 
as shown in Fig. 16. 



In considering the rotational and vibra¬ 
tional states of a molecule, and hence the 
motion of the nuclei, it may be shownf 
that it is a good approximation to treat 
the nuclei according to SchrGdinger’s equa¬ 
tion as moving under a mutual interaction 
with potential energy V(r). In that case 
the Schrfidinger equation is formally the 
same as that discussed in the last section, 
if m v m 2 are the masses of the nuclei. 
The solution is in fact 


Fig. J 6. Energy of a molecule) 

a« a function of the inter-atomic <MR)<A 2 (r), 

diwtance R. 

where i/^R) describes the motion of the 
centre of gravity as before, and i/j 2 ( r) is of the form 


Pfc(cos 0)e iu< f > L(r)e~ im l h , 


W 


where L(r) satisfies 


1 d 

r 2 dr 



~{W-V(r)}-*L K +iy 


L== 


0 . 


The approximation usually made in describing the levels is that for 
the lower vibrational states only the form of V near the minimum need 
be considered, and so V may be written 

V = F 0 +lp(r-r 0 ) 2 . 


In a crude appr6ximation, assuming the amplitude of the vibrations to 
bo small compared with r 0 , we replace K(K+l)/r 2 by K(K+l)/r$. 
The energy values are then 

The first term gives the vibrational energy, the second the rotational 
energy. 

These energies are in general small compared with electronic energies, 
containing as they do the large nuclear mass m in the denominator 
either as m* or m. The interval between the rotational levels is in 
general less than that between the vibrational. 

f M. Bom and J. R. Oppenheimer, Ann, d. Phy&ik , 84, 457 (1927). 
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In the case where the diatomic molecule is built up of two atoms 
of the same kind, as for example in the hydrogen molecule H 2 , the 
wave functions tff(v 1 r 2 ) have certain symmetry characteristics which, as 
we shall see in the next section, assume great importance. Suppose we 
interchange the positions of the two particles. Then R, the position 
of the centre of gravity, is unchanged, as is also r , the distance between 
them. 8 , however, defined by 

CO8 0 = (Zx—Z^lr, 

is changed to tt—6, and <f> is changed to 7r+<^. It will be seen from the 
definitions of the spherical harmonics that Pj£(cos 6)e iu i and hence 
i/f(r 1 r 2 ) changes sign if K is odd but is unchanged if K is even. Thus 
0(r x r 2 ) = ifj(r 2 r x ) (K even), 

0( r i r 2 ) = —0(r 2 r x ) (K odd). 

24. Symmetry characteristics of the wave functions 

It will now be shown that the symmetry characteristic proved above 
for the wave functions describing the positions of the nuclei of a 
molecule is a general property of the wave functions describing non¬ 
degenerate states, with quantized energy levels, of any atomic system 
consisting of two identical particles. We denote by q x the group of 
coordinates (x v y l9 z v a 1 ) describing the position and spin orientation 
of the first particle, and by g 2 that of the second; if the particle has 
no spin, as for the a-particle for example, the cr-coordinate is omitted. 
Then we shall show that for all non-degenerate states either 

= <f>(M i). 

or ( 6 ) 

States of the first type are called symmetrical, those of the second type 
anti-symmetrical. 

The proof is elementary; 0 satisfies a wave equation of the typo 

WPi9v P*9a)—Wy>(9 i?a) = °- ( 7 ) 

The Hamiltonian H 0 is necessarily symmetrical; therefore by inter¬ 
changing the suffixes 1 and 2 we see that i/j(q 2 <h) is a solution of the 
same equation as q But the solution is non-degenerate; in other 
words, only one solution of the equation exists. Therefore 

where A is a numerical constant. Interchanging q 1 and q 2 on the right- 
hand side of this equation we obtain 

l A(?22i) = ^¥(?22l)- 
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Thus 


A 2 = 1 


and 


A = ±1. 


We have thus proved that all solutions satisfy one or other of the 
relations (6). 

We shall now show that if a system is originally in a state described 
by a wave function of given symmetry it will remain in a state of that 
symmetry; that is to say, transitions between symmetrical and anti- 
symmetrical states are impossible. Again the proof is elementary. 
Whatever the perturbing influence to which the system is subjected, 
it must be represented by a symmetrical Hamiltonian; thus if the 
perturbing field is such that the potential energy of an electron in it 
is V(q ), the perturbing terms to be added to the Hamiltonian are 
V{q 1 )+V(q t ). Thus the change with time of the wave function is 


determined by the equation 


ti d\fj 
i dt 


= H+, 


where 11 is a symmetrical Hamiltonian containing the perturbing 
functions. 


It follows that if 0 is initially symmetrical, the change 80 that will 
occur in a time interval 82 is symmetrical; and if 0 is initially anti- 
symmetrical, so is 80. The symmetry of 0 therefore remains unchanged. 

To proceed further, we must appeal to experiment and point out the 
remarkable fact that in nature, in the energy spectrum of a system of 
two particles, only the energy levels corresponding to states of one 
symmetry are observed, for instance in the optical spectrum. For 
electrons and also for protons, both of which have a spin of only 
the anti-symmetrical states are observed; for a-particles, which have no 
spin, only the symmetrical states. We shall show below the evidence 
for this statement; we shall discuss first some of its consequences. 

First of all it follows that, in describing pairs of particles in the free 
state as well as in the bound state, we must use a wave function of the 
required symmetry. If we did not, we could predict the formation of 
quantized states of the type which are not in fact observed. Suppose 
then we perform a measurement on one particle, the results of the 
measurement being describable by a wave function u(q ). Suppose we 
perform another measurement on another particle, and describe the 
results of this measurement by v(q). Then we might think that the 
correct wave function to describe the system consisting of the pair of 


particles would be 


u(qMq 2 ), 
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where q x is the coordinate of the first particle and q 2 of the second. 

We know, however, that this is not correct. If the particles are electrons 
the correct wave function must be 

^Mqz)--u(qM9i)- (&) 

If they are a-particles the sign must be reversed, giving 

u(qiMq2)+u(qMqi)- W 

At first sight this is rather surprising, because having made the 
appropriate measurement we know that particle 1 is in the state u. 
It is therefore necessary to introduce a slightly different interpretation 
of the wave function i/j(q 1 q 2 ) which describes the state of two identical 
particles. This interpretation is as follows: 

is equal to the probability that a particle has coordinates q v lying 
between q x and q' 1 -\-dq' 1 and the other particle has coordinates q 2 lying 
between q 2 and q 2 -\-dq 2 . No attempt is made to state which particle is 
at which position, and the suffixes 1, 2 refer to two volume elements 
in space, rather than to the coordinates of two particles. 

With this interpretation it is obvious that \^{q x q^)\ 2 mus t be a sym¬ 
metrical function of q x q 2 \ and thus that our system of two particles 
must be described either by the anti-symmetrical wave function (8) or 
by the corresponding symmetrical one (9). Thus, unless in our formula¬ 
tion of quantum mechanics we exclude either the symmetrical or the 
anti-symmetrical functions, the theory becomes ambiguous; we do not 
know whether to describe the result of our two measurements by (8) 
or by (9). It is therefore necessary to exclude either symmetrical or 
anti-symmetrical functions; to determine which should be excluded for 
particles of a given type we still have to appeal to experiment. 

Particles which must be described by symmetrical wave functions 
are said to obey the Einstein-Bose statistics, and particles which must 
be described by an anti-symmetrical wave function to obey the Fermi- 
Dirac statistics. 

For quantized systems, zero-order wave functions can be shown to 
be of the types (8) or (9) by applying perturbation theory as in § 17.3. 
Suppose, for example, that we consider two electrons in an atom. Then 
if the interaction between them is neglected or considered small, we 
can retain the concept of separate stationary states for the individual 
electrons. Suppose that these states have wave functions $ n (q ). Then 
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a state in which one electron is in the state n and another in the state 
n' can be described by either of the two wave functions 

0n(l)*M2) or 0„(2>M1), 

both of which give the same energy. The state is therefore degenerate. 
When, however, the interaction between the two electrons is taken 
into account, the energy will in general split; if this occurs, the correct 
wave functions of zero order will be 

;^2 Ww( 1 )lA»'( 2 ) i 1 )}• 

That this is the case follows from the proposition already proved, that 
solutions of the SchrOdinger equation for non-degenerate quantized 
energy levels must be symmetrical or anti-symmetrical. It also follows 
from the methods of § 17. If we set for the wave function 

^(l)^,(2)+S^2)^,(l) 

and introduce a perturbing energy F(12), we find that the shift w in 
the energy is given by 

(V a ~wf = \V b \*, ( 10 ) 

w = V a ±V b , 

V a = j ^(l)«.(2)F(12)^(l)^,(2)* yi r/y 2 , 

V„ = / ^*(l^U2)V(12)^2)^Jl)dq t dq r 

It will easily be seen that the two wave functions of zero order corre¬ 
sponding to these two values of w are those given by (8) and (9). 

24,1. Introduction of the spin coordinates 

The electron (and, as we shall see, the proton) has a spin of \h and 
can thus be described by a wave function ^(o^), where o z takes the 
values ^ I only. For ease of notation the suffix z will be dropped in 
this section. 

If we introduce a strong magnetic field H which will break down 
any coupling between spin and orbital motion, or between one spin and 
another, then the wave functions describing the two stationary states 
of each electron will be the functions already introduced, x<x( a )> Xpi* 7 )* 
defined by 

X<» = X/s(° r ) = 0 = !)» 

Xcc(°) = °» XjSM = 1 (a = —1). 


so that 
where 
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A pair of electrons will then have four stationary states, described by 
wave functions 

X«( G 1 )Xol\°2) 

xMxpi^+xMxfai) 

Xoc(°i)xii(°2)-xMx(}(°i) 

xp( a i)xfi{°*) 

These states have energies 2/z/7, 0, 0, —2/z/Z. The suffixes 1, 2 refer 
to the spin coordinates of the two particles; as already emphasized, no 
attempt must be made to associate either suffix with a particular 
particle. It will be seen that three of these functions are symmetrical 
and one anti-symmetrical. 

In the same way, if we are setting up wave functions for (e.g. nuclear) 
particles with spin sh, so that for each particle there are 2,9+1 stationary 
states, there will be (25+1) 2 states in all. Of these, in 2s +1 both 
particles will be in the same state, and the wave function will be 
symmetrical. Of the remaining 25(25+1) states, half will be symmetrical 
and half anti-symmetrical. There are thus 

5(25+1) anti-symmetrical states 
(5 +1) (25 +1) symmetrical states 

When the interaction between the spins is taken into account, it 
may no longer be a good approximation to express the spin wave 
functions as products or sums of products of the form (11). Neverthe¬ 
less, the property proved above must persist; if sfi is the angular 
momentum of each particle, the system of two particles will, for each 
orbital state, have (25+1) 2 states of the spin, of which (5+l)(25+l) 
will be described by symmetrical wave functions x s (a 1 cr 2 ) and $(25+1) 
by anti-symmetrical wave functions Xa( g i ^ a )* 

When therefore we come to consider the stationary states with 
quantized energy values of two particles with spin, the position is as 
follows: first of all, the wave functions must be found for two particles 
without spin; these will be of the form ^ s ( r i r 2 ), *iM r i r 2 ) 5 some, denoted 
by the suffix S , are symmetrical, others, denoted by the suffix A, anti- 
symmetrical. The complete wave function for the positional and spin 
coordinates may have either of the four forms N 


l As( r i r 2)Xs( <7 l U i.) 

'l l A r \ T i)xJpx a %) J 

| symmetrical, 

(13) 

'l’s( r i r z)XA( <J l °2) } 
lM r i r 2)Xs( a l ff 2) J 

i anti-symmetrical. 

(14) 
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The first two are symmetrical, the second two anti-symmetrical in the 
complete group of coordinates (r, a) that describes the state of the 
particle. Thus either the first two types, or the second two types, must 
be excluded. To find out which, we must make use of the observed 
properties. 

We may anticipate the findings of the next section and state here that 
for protons and electrons only anti-symmetrical functions may be used, 
for a-particles and in general for nuclei with an even number of heavy 
particles (nucleons) as constituents, only symmetrical functions. 

We have seen that transitions between states symmetrical and anti- 
symmetrical in the whole group of coordinates (ror) associated with a 
particle are absolutely forbidden. Transitions between states of the 
two types, for example 

M T i r z)XA^l a i)> , lr , 

(15) 

I M r I r 2)xs(°l°2)> 

while not absolutely forbidden, have nevertheless very low intensities. 
The matrix element of any perturbing function F(r, a x ; r 2 a 2 ) vanishes 
with respect to these two functions; it will be shown in Chapter X that 
this matrix element determines the transition probabilities. To obtain 
a finite transition probability, therefore, more accurate wave functions 
than the approximate ones (15) must be used; in other words, the 
effect on the wave functions of the spin-orbit interaction must be taken 
into account. In particular, for nuclei, since the magnetic moment 
of the nucleus is about 1,000 times less than for the electron, the 
probability of these transitions is very small indeed. 

24.2. Application to diatomic molecules 

The considerations of this section are valid only for homonuclear 
molecules such as H 2 in which the two nuclei are identical. If the 
element consists of a number of isotopes, they do not apply to the case 
where the two nuclei have different masses. 

We denote by r v r 2 the spatial coordinates of the two nuclei, and 
by a v cr 2 their spin coordinates. The orbital wave functions i/f(r t r 2 ) 
have already been found; they are of the form 

•Mr x r 2 ) = PZ(cos6)e iu *L(r), 

where (r,0,^) are the spherical polar coordinates of the vector -r 2 . 
It has already been shown that i/j k is anti-symmetrical if K is odd, 
symmetrical if K is even. 

If then we limit ourselves to anti-symmetrical wave functions, we see 
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that for a given orbital state the spin wave functions will give rise to 
(£-fl)(2s+l) states if K is odd, s(2s~{-l) if K is even. If we are limited 
to symmetrical wave functions, these numbers are reversed. Let us 
then consider transitions between two electronic states of a diatomic 
molecule; the energy levels corresponding to the two states will be, 
according to formula (5), 

W 1 +Kv 1 {n+\)+WK(K+ 1)/2J 1# 
Tf 2 +W^+i)+^'(X'+l)/2/ 2 , 

where I l9 I 2 are the moments of inertia of the two states. The usual 
selection rule for diatomic molecules is 

K-K'= 0 or ±1, 

but for homonuclear transitions K—K f — ±1 is excluded by the argu¬ 
ments of the last section. Therefore the rotational structure of a band 
is given by the formula 

hv - hv 0 +K(K +1 )[!* - ?•]. (16) 

Also, if a gas containing diatomic molecules is in thermal equilibrium, 
there will be differing numbers of molecules in states with odd and 
even K, the numbers being in the ratio s:($+l) or (s+l):$. Further, 
this ratio will not be altered when the molecule is excited by electronic 
bombardment or in any other way, since the probability of transition 
from odd to even states is so small. Thus, finally, we see that the 
intensities of the lines given by formula (16) should alternate, odd lines 
and even lines having intensities in the ratio sj(s-\- 1) or ( s-]~l)/s . 

Turning now to the results of observations on diatomic molecules, 
it is found that for the molecules He 2 , C 2 , and 0 2 the lines with odd K 
are not observed; therefore these nuclei must have no spin and obey 
Bose-Einstein statistics (symmetrical wave functions). Also for N 2 the 
odd states have half the intensity that the even states have. There¬ 
fore the nitrogen nucleus has a spin of unity (s = 1) and again obeys 
the Bose-Einstein statistics. 

It should be pointed out that these relations hold only when the 
molecule is formed from the common isotope, e.g. He 4 , C 12 , or 0 16 . No 
such alternation of intensity is expected or observed for molecules of the 
type 0 16 0 17 , for example (for which the rotational levels are slightly 
displaced on account of the changed moment of inertia).| 

It is significant that nuclei of even atomic weight always obey Bose- 

t Cf. R. T. Birge and A. S. King, Phya. Rev . 34 , 376 (1929). 
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Einstein statistics, as far as known. This is to be expected if they are 
built up of heavy particles (protons and neutrons) only. 

For hydrogen, lines corresponding both to odd and to even values 
of K occur; but lines for which K is odd have three times the intensity 
of the lines for which K is even. This shows that hydrogen nuclei 
(protons) obey Fermi-Dirac statistics (anti-symmetrical wave functions) 
and have a spin of one-half quantum (sh where s = £). 

Results for many other nuclei can be found in any book on nuclear 
physics, for example the review by Seaborg, Rev . Mod . Phys. 16 , 1 
(1944). 

The arguments given above show that transitions between the two 
forms of hydrogen (those with odd and even values of K) will proceed 
at a very slow rate. This means that if hydrogen gas in equilibrium 
at some temperature is heated or cooled to some new temperature, it 
will take a very long time to reach equilibrium. This gives rise to 
anomalies in the specific heat of hydrogen gas at low temperatures. 
The two forms of hydrogen (A" even, K odd) are called para- and ortho¬ 
hydrogen. The one form can only be changed into the other by a 
catalyst which dissociates the molecule and allows the atoms to recom¬ 
bine after being mixed up.f 

24 * 3 . The helium atom: general principles 

Considering first the orbital motion, and hence the wave functions 
for the spatial coordinates, we expect to have a series of symmetrical 
wave functions t/* s (r 1 r 2 ) with associated energies W s and anti-sym¬ 
metrical wave functions ip A (r x r 2 ) with energies W A . The ground state 
will be a symmetrical function. Including spin coordinates, the anti- 
symmetrical functions are of the type 

M r i r 2 )Xa{°i°2)> one spin state, j (n) 

^( r i r 2 )x«( or i (T 2 )» three spin states. / 

If electrons obey Fermi-Dirac statistics (as they do!) we expect there¬ 
fore to find in helium two sets of energy levels, between which optical 
transitions are extremely weak. One set of these, known as the par- 
helium levels, will have symmetrical orbital wave functions and will 
thus include the ground state; these states have zero spin moment 
(anti-parallel spins) and are not split by the spin. The other set (ortho¬ 
helium) with anti-symmetrical levels will be split into three states by 
the spin and thus give a triplet series. 

t Cf. for example, R. H. Fowler, Statistical Mechanics , 2nd ed. (Cambridge, 1936), 
p. 850. 
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The spin wave functions (17) are wave functions for stationary states 
only if a strong external magnetic field is applied, sufficiently great to 
break down the spin-orbital coupling and produce a Paschen-Back 
effect. The spin wave functions in the absence of an external field will 
be combinations of these wave functions. For a mathematical treat¬ 
ment of this case, a derivation of the triplet separation and of the 
Lande g- factor, cf. for example H. Bethe, Handb. d . Physik , 24/1, 
p. 393 (1933). 

Methods of obtaining the orbital or spatial wave functions are given 
in Chapter VI. 

24.4. Collisions between identical particles 

The symmetry characteristics of the wave function affect the in¬ 
tensity of scattering when one particle collides with another of identical 
type, for example when a-partieles are scattered by helium gas. The 
wave equation for the two particles 
(equation (2)) can as before be solved 
in the form 

0i(R )Mr)> 

where R is the coordinate of the 
centre of gravity and r is the vector 
joining the position of the two par¬ 
ticles. If we refer the system to 
moving axes with respect to which 
the centre of gravity is at rest, 
ip x ( R) is a constant; we thus consider two streams of particles impinging 
with equal and opposite velocities, as in Fig. 17. The function i/r 2 (r) 
satisfies the differential equation 

+*£{W-V{r)W t = 0, 

where V{r) is the interaction energy and m* is the ‘reduced mass’, in 
this case half the mass of either particle. As will be shown in Chapter 
IX, a solution i/j(r x r 2 ) can be obtained which for large r has the form 

^e***+— f(d). (18) 

r 

With this unsymmetrical wave function we should interpret |/(0)| 2 as 
the number per unit solid angle scattered from beam A and |/(7r—0)| 2 
as the number scattered in the same direction from beam B. The total 
intensity of scattering would thus be 

l/WI 2 +l/(^-^)l 2 - 



Fig. 17. Collision between identical 
particles. 


(19) 
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The function (18), however, does not have the desired symmetry. 
Suppose, for instance, that we are dealing with a-particles; for these 
the spin is zero and the wave function must be symmetrical. Since 
an interchange of the coordinates of the two particles changes z into 
—-s and 9 into ir-d, the correct wave function is 

pikr 

~ 2 cosfo +—[/( 0 )+/(w— 0 )], 

Y 

and the total intensity of scattering in the direction 9 is, instead of (19), 

\m+f(ir-0)\\ ( 20 ) 

There is no possibility, theoretical or experimental, of distinguishing 
between knocked-on and scattered particles; ( 20 ) gives the total 
number. 

It is particularly to be observed that when 9 = 90° formula ( 20 ) 
gives just twice as many scattered particles as the unsymmetrical wave 
function (19). 

If the velocity of the particles is not too high, the electrostatic 
repulsion between the particles alone is responsible for the scattering. 
Under these conditions, as shown in Chapter IX, |/(#)| 2 reduces to the 
form given by classical mechanics, namely 



and thus the quantum mechanical formula ( 20 ) gives just twice the 
classical value at 90°. 

In order to verify this prediction of quantum mechanics that for 
identical particles without spin the scattering would be twice the 
classical, experiments were carried out by Chadwick,f who bombarded 
helium with a-particles; it is then to be expected that twice the classical 
scattering will occur at 45° (since in the actual system the centre of 
gravity is not at rest). In Fig. 18 a is shown the ratio of the observed 
number of scattered particles between 40° and 50° to the number to 
be expected classically. It will be seen that at low velocities this ratio 
tends to 2 ; at higher velocities the scattering is determined by the 
structure of the a-particle. 

For Coulomb scattering the function f(9) has the formf 

-—5 cosec 2 19 exp — i - 17 — log ( 1 —cos 9) +const. 

2mv 2 L 

The phase factor, which contains Planck’s constant h and is therefore 
t J. Chadwick, Proc. Roy. Soc. A, 128, 114 (1930). 

} Cf, Mott and Massey, Theory of Atomic Collisions (Oxford, 1933), p. 35. 
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a consequence typically of quantum mechanics, does not affect the 
collision between particles that are not identical. As is evident from 



Fid. 18a. Ratio of observed scattering of a-partiolos in helium to classical Coulomb 
scattering. K is the energy of the particle, and hJ 0 that of the original particle from a 
polonium source. 



Fig. 186. Theoretical value of ratio shown in Fig. 18 a. Curve 1, v — 8x 10 8 , 
curve 2, v = 4 x 10 8 , curve 3, v — 2x 10 8 cm./sec. 

formula (20), however, it does do so in the case of identical particles; 
the ratio of the scattering given by (20) and by the (classical) formula 
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(19) is plotted in Fig. 186 for values of the velocity equal to 2x 10 8 , 
4 X 10 8 and 8 X 10 8 cm. per sec. The smaller the velocity, the more rapid 
are the oscillations. 

In order to verify the existence of these peaks a detailed statistical 
investigation was made of the scattering of a-particles by helium in an 
expansion chamber by Blackett and Champion.*}* The results fully 
confirmed the existence of the minimum round about 35° for v ~ 2 X 10 8 
cm./sec. 

If the colliding particles have spin (protons or electrons) the treat¬ 
ment is rather more complicated. In this chapter a discussion will bo 
given only of the case where the particles have spin \ (angular momen¬ 
tum \ti) and obey Fermi-Dirac statistics. 

If the particles have opposite spins, they may be treated as distin¬ 
guishable; the number of particles scattered in the direction 6 will be 
given by (19), namely 

l/(0)l 2 +L/V—0)| 2 . (2i) 

If they have the same spin, the wave function in the spatial coordinates 
must be anti-symmetrical; the number of particles scattered will there- 

fore be l/W-/(»- 0 )|*. ( 22 ) 

Taking the mean of the two, the scattered intensity is 

mm z +\A*-o)\ 2 +m-fw-m' 
ie. ww+ffr-w+hm-f^-vn 

This formula may be obtained in a slightly different way. Suppose 
that a measurement is made which shows that one electron is in the 
neighbourhood of the point A, with wave function tf/ A ( r) and with spin 
wave function the second electron lias wave functions ^(r), xp( s )* 
Then the wave function describing the pair of particles is 

$A( r i)XoL{ S l)$B( r 2)Xp( S 2)~$A( r 2)X*( S 2)$Ii( r i)Xp( S l)- 

If the spins are in the same direction (oc ~ j8 ), then it follows that the 
spatial wave function is anti-symmetrical and that the scattering is 
given by (22). If they are in opposite directions it may be written 

M^(l)«Ai,(2)-^(2)^(l)}{ Xa (l)x/ J (2)+xM 1 )Xa(2)}+ 

+ i{^( 1 )^(2)+^(2)^(l)}{Xa(lM2)-X^l)Xa(2)}. 

f P. M. S. Blackett and F. C. Champion, Proc. Roy. Soc. A, 130, 380 (1931). 
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From this it follows that the scattering is given by 

i l/(^) +/(^—0) 1 2 +i 1/(0) —/(^r—0) 1 2 , 

which is identical with (21). 


24 . 5 . The Pauli principle 

The Pauli principle, as originally formulated, stated that no two 
electrons in an atom could have the same set of four quantum numbers. 

The Pauli principle follows from the more general statement that the 
wave function i/j( r 1 <r v r 2 cr 2 , r 3 o- 3 ...) of all the electrons in an atom, or 
in any other system, must be anti-symmetrical in the coordinates of any 
two electrons. When the interaction between the electrons is small 
enough for it to be possible to represent the orbital wave function by 
a product of functions of the coordinates of individual particles, the 
Pauli principle results. This may be seen as follows. 

It is simplest to consider an atom in a strong magnetic field suffi¬ 
cient to produce a Paschen-Back effect, so that there is no coupling 
between the spin and the orbital motion. Suppose that the n individual 
electrons are in orbital states a, b , c,..., n with wave functions <//„(r), 
\jj b ( r).... If the total spin along the field is sft, this means that \n-\-s 
spins are oriented along the field and In—s in the other direction. 
Thus there aro n\j(\n— «)!(^w+s)! ways in which a spin wave function 
Xoc( a ) or Xp( a ) can associated with an orbital wave function *A a (r). 
Let the series of functions 

>Pa( r )Xa( f T)> M r )Xh( a )> <Pc( r )Xc(«)’ - ( 23 «) 

represent one of these ways; the suffixes a, b , c, must when appended 
to x denote a or /?. Then, as long as none of these wave functions is 
the same, we can form an anti-symmetrical wave function in the form 
of the determinant 


•/'o( r l)Xa(° r l) <A«( r 2)Xa( ff i) • • • 

tb( r l)Xb(°l ) '/'b( r 2)Xb(< r 2) • • • • 


(236) 


If, however, any of the individual wave functions (23 a) are the same, 
(23 b) vanishes and no anti-symmetrical wave function is possible. This 
proves the Pauli principle. 

The function (236) is not the correct zero-order wave function of the 
system; to obtain this we must add together all of the n\j(\n— $)!(£n-f $)! 
functions of the type (236) which do not vanish, with coefficients which 
have to be determined. Methods by which this can be done are 
described in Chapter VI. 
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25. The helium atom: methods of obtaining wave functions 

In this section we discuss only the wave functions in the spatial 
coordinates; we have therefore to solve the SchrOdingcr equation 


vf<A+vi<A+^'{ b 


2 e 2 . 2 e 2 


F^iK 


0 . 


(24) 


As has been shown already, we expect to obtain a series of symmetrical 
wave functions i/f >s (r 1 r 2 ) with corresponding energy values, which with 
the introduction of the spin remain singlet terms, and a series of anti- 
symmetrical wave functions if* A (r 1 r 2 ), which become triplet terms. 


25.1. Perturbation method of obtaining energy levels 
In order to obtain approximate values for the energy levels it is 
convenient to consider both electrons moving in some field F(r); the 
SchrOdingcr equation 

Vy + 2 ^{W-V(r)W = 0 (25) 


is then solved, and characteristic energies W n and wave functions 
obtained; wave functions of zero order for the helium atom are then 
taken to be j 




with energy W n -\- W n >; and the remaining term in the potential energy 


m 77/ \ 2 e 2 2 e 2 

U(r 1 r 2 ) = —--— + r- 


■ V(r x )—F(r,), 


is treated as a perturbation and the change in the energy is calculated 
by the methods of Chapter III. In his original paper on the helium 
atom, Heisenbergf took for V(r) 


V(r) = 



v(r) — e 2 /r 0 
= e 2 /r 


(r < r 0 ) 
(r > r 0 ), 


where r 0 is some suitably chosen radius; clearly each electron will move 
in the field of a doubly charged nucleus for small r and that of a singly 
charged nucleus for large r. This form of V(r) is not, however, the best 
possible choice (cf. § 25.2). 

The change in the energy due to the perturbing term U (r x r 2 ) is then 
(we write ^(1) for 

i ff l^(l)«A„-(2)±^(2)0 n .(l)| 2 ^(r 1 r 2 )rfr 1 dr 2> 


t W. Heisenberg, Zeita.f . Physik , 39, 499 (1927). 
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which reduces to J nn -±K nn ., where 

*4m' == JJ (^1 ^ 2 )! 1 /v0 i” d T l ^ t 2 

and K m , = jj U(r x T 2 )^„( 1 )t/i*.(l)ip n (2)ifi*.(2)dr x dr t . 

The integral K is called the ‘exchange’ integral. The ground state 
of helium has the wave function of zero order and there is 

thus no splitting of the energy value when interaction between the 
electrons is taken into account; when, however, one electron is excited 
the pair of wave functions of zero order 

are possible, with energies to the first order 

Wo + w^+«4»±-^0ff 

The terms with the positive sign form the series of levels (singlets) which 
include the ground state. 

25.2. Choice, of the field V{r); the Hartree method of the self-consistent 
field 

It has been shown by Hartreef that the best field F(r) for the calcula¬ 
tion of the wave functions of zero order, at any rate for the ground 
state of helium, may be obtained as follows: if ifj(r) is the wave function 
for the ground state of an electron moving in the field F(r), and thus 
the solution of (25), V(r) is given by 

V(r) = -~+ J imi'fzpjdr'. ( 26 ) 

This field is termed the self-consistcnt field; it can only be obtained by 
a series of successive approximations, since if* cannot be calculated until 
an estimate of F is made, which must then be modified using the value 
of tfj obtained. 

It will be seen that F(r) is the potential energy of an electron in the 
field due to the nucleus and to a charge distribution (due to the other 
electron) of density —e|0(r)| 2 at any point at a distance r from the 
nucleus. 

The self-consistent equation is thus 

VN’+^{W-V(r)W = 0. 
t D. R. Hartree, Proc. Camb. Phil. Soc. 24, 89 (1928). 
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It will be noticed that IT or 2 IF is not directly related either to the ioniza¬ 
tion potential or to the total energy of the atoms. The total energy 
of both electrons (minus the work required to remove them both) is 

Jj ip*(l)ip*(2)Hip(l)ip(2) £fr 1 rfr 2 

= 2w-e* JJ 

It is important to realize that the Hartree approximation does not 
entirely neglect the influence of one electron on another; for instance, 
in helium the field V{r) behaves as — 2e 2 /r near the nucleus and as 
—e 2 /r at large distances, owing to the screening effect of the other 
electron. On the other hand, if one electron is on the right-hand side 
of the atom, the other electron is more likely to bo on the loft than on 
the right because of the repulsion between them. This fact is neglected 
in Hartree ? s approximation; to take it into account terms involving 
| r x —r 2 l must be introduced into the wave function. 

The proof that the self-consistent field gives the best approximation 
for the ground state requires the following lemma: 

Consider a Hamiltonian function H(p ... q ...) from which a Schro- 
dinger equation can be set up. Let \fj(q ...) be any function of the space 
coordinates q satisfying the normalizing condition 

J dq = l. (27) 

Then, if the integral W = J ip*IIift dq (28) 

is formed, the lowest energy value W Q corresponding to a characteristic 
solution of the SchrGdinger equation 




is equal to the lowest possible value of the integral (28) for all possible 
choices of t/t which obey (27). 

The proof is as follows: suppose we vary if* by adding to it a small 
arbitrary function S</r of q. Then since the integral (28) is a minimum, 
the change in its value will be zero to the first order. Thus 


J 8 dq+ j ip*H Sip dq = 0. 

By the Hermitian property (cf. Chap. I) the two integrals are equal; 

J Sip*Hip dq = 0. 


we thus obtain 


This is valid for all functions Si/j such that, by (27), 


J S^r*^r dq = 0. 
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Using the method of undetermined multipliers, we see this is the case 
for a certain choice of a parameter A if 

J = 0 

for all functions 80*. It follows that 

Hifj —A 0 — 0, 

which gives us just the Schrodinger equation. 

The other solutions of the Schrodinger equation also give minimum 
values of (28), 0 being varied in such a way as to keep the number of 
nodes or nodal surfaces in the wave function a constant. 

The self-consistent or Hartree solution can now be shown to be the 
one which gives the minimum energy, and hence the best approxima¬ 
tion to the wave functions when (in the case of an atom with two 
electrons) the wave function 0(r l5 r 2 ) is taken in the form of a simple 

■ ,Ioduct «r 1 )«rj. (29) 

For if we substitute (29) in (28) above and vary 0 by 80 we obtain 

/ 8>A(1)( f — o 

for any variations of i/j(r) which satisfy 

J 0(r) 80(r)rir = 0. 

Using as before the method of undetermined multipliers, we obtain the 
following differential equation to determine 0(r): 

jj #W(2) *.}*(!)—W) - 0. (30) 

The term in the curly brackets gives 

J 4,mmdr,= -—Vf+VM+A, 

where V(r) is defined by (26) and 



Equation (30) thus becomes 

VV + 5?{tt’-y(r)}«A = 0, 

where w = W—A. This is Hartree’s self-consistent equation for 0. 
It will be noted that the total energy of the two electrons is given by 
w == W—A. 
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25.3. The Fock equation 

If, in the approximation of zero order, the two electrons are not in 
the same state and thus if an excited state of the helium atom is under 
consideration, the wave functions derived by Hartrec’s method do not 
give the best possible approximation. If the wave functions concerned 
are <// 0 (r), Hartree’s method would consist in making the integral 

JJ 

a minimum. A better value for the energy will be obtained by using 
symmetrical or anti-symmetrical wave functions for the terms of ortho - 
and parlielium respectively and minimizing the integral 

w - JJ W(1)«(2)±#H2)«(^^ 

This was first pointed out by Fock,f and the resulting equations are 
called Fock*s equations. These equations are, as may easily be shown 
by the methods of the last paragraph, 

{£v* + 2 y + W-H n „-O nn (r )^ 0 = ±{U u +Cl n (r)}K 

(£ V2 + T +W ~ IlM ~ 0wir) }^ = 



G ik (r a )= ( T-^—MtWrJdTr 

J i r i — r 2l 

As in Hartree’s equations, a solution can only be obtained by a method 
of successive approximations. An attempt to solve the equations for 
the excited 3-states of helium has been given by L. P. Smith.J 

Further consideration of Hartree’s and Fock’s methods for atoms 
of more than two electrons is given in the next chapter. 

26. The deuteron 

A two-body problem of some importance in theoretical nuclear 
physics is that of determining the energy levels of the deuteron. The 
deuteron, consisting as it does of two elementary nuclear particles (or 

t V. Fock, Zeits.f. Physik. 61, 120 (1930). Soe also J. C. Slater, loo. cit., and Brillouin, 
Actualites Scientiftques, iv (Paris, 1934), who gives a detailed account of the two equations 
and of the relation between them. 

t L. P. Smith, Phys. Rev. 42, 176 (1932). 
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nucleons ), one proton and one neutron, plays in nuclear physios a role 
similar to that assumed by the hydrogen atom in atomic physics. 
Further, since any two-body problem can be integrated explicitly if 
the force between the two particles is a known function of their distance 
apart, calculations on the; deuteron are more suitable for quantitative 
comparison with the experimental results than any others in nuclear 
theory. 

In order to deal with the two kinds of nucleons it is necessary to 
introduce a convenient formalism; this was first achieved by Heisen¬ 
berg,f who introduced the isotopic spin variable t analogous to the spin 
variable s. Just as in the latter case s 1 represents a particle with 
a spin in the direction of the 2 -axis and s — — 1 a particle with spin 
in the opposite direction, l — 1 represents a neutron and l -- — 1 
represents a proton. In precisely the same way there an; isotopic spin 
operators t — (t x) t u , r z ) corresponding to the Pauli spin operators 
a — (<r r , a ir a z ) defined above. 

The wave function of the deuteron is thus a function not only of 
the space coordinates r (1) , r (2) of the two particles but also of the spin 
coordinates s (1) , a (2) and the isotopic spin coordinates / (1) , P\ Thus 

ifj = ifj{ r (2) , s< 2) , *»). (31) 

For the deuteron the wave function iff may be written as a product 

i/j = <f>{ r* 1 *, r (2) )y(,s* (1) , s (2) ; p)), (32) 

where <f> is a function of the space variables and y a function of the 
spin and isotopic spin variables. For nuclei containing more than two 
nucleons it is not possible to separate the space coordinates from the 
spin and isotopic spin variables in this way. 

To fulfil the Pauli exclusion principle i p must be an anti-symmetrical 
function. Thus either </> is symmetrical and x is anti-symmetrical or <f) is 
anti-symmetrical and x is symmetrical. In the ground state we would 
expect the former of these conditions to hold. If we denote the spin 
wave functions by y a , xp> as above, and the corresponding isotopic 
spin functions by x a > Xj8> fben for the triplet state we have 

X ( S S) = Xoti Xa a (Xa, Xp —Xpi XocJ> ( 33 ) 

and for the singlet state 

x( 1S ) = (XcdXp—XPiXjiX^Xpz+XPiXa*)- ( 34 ) 

Using the results (23) of Chapter IV we have 

= (Xpi’ + i Xf3 l ’ Xj’ 

t W. Heisenberg, Zeits.J. Physik, 77, 1 (1032). 
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and similarly for af^Xa» so that 

(c^.o^XoaXo, = (o^Xa.-^Xa.) = XkXocs ( 35 ) 

The t’s and x’s satisfy the same relations as the o-’s and x’s, i.e. 

TxXac = Xli, T v Xa = +%, TzXoc = Xoc 
r xXp = Xa> TyXfi = —^Xa. T *X/} = ~X/J 
SO that (^.^(Xa.Xft— XftXa.) = — : 3(Xa,-X/S,— XPi-Xot.)- ( 3? ) 

Now according to the Moller-Rosenfeld meson theory*)* of nuclear 
forces the interaction between two nucleons is given by 

U = (t>» . t (2 >)[(? 2 + (o ( 1 » . o®)^ 2 ] —’ (38) 

r l2 

where r 12 denotes the distance between the two particles, F and G are 
two constants with the dimension of electric charge, and A is a constant 
related to the mass, m, of the meson by the formula 



A = 


me 

T" 


(39) 


If M denotes the mass of a nuclear particle, then transforming the 

Schrttdinger equation to coordinates in which the centre of gravity is 

at rest, we obtain M 

V 2 'il, + ~(W-UW - 0 (40) 

1 \, 

as the wave equation for the deuteron. 

For the triplet state we have from equations (35) and (37) that 

I7«A=-3(* 12 +(7 2 )— 

r 

and similarly for the singlet state it can bo shown that 

Ut/, = ((? 2 -3jP 2 )—V- 

r 

Thus in both cases we may write for the state (l -- 0) of the deuteron 


Writing 


^ 2 

dr 2 r dr h 2 


\[ w + b -^=»■ 


we obtain the equation 


d P * + 



(41) 


t C. Mailer and L. Bosenfeld, D. Kgl. Danske Vidensk. Selskab, Mek.-fys. Medd. 17 , 
No. 8 (1940). 
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The differential equation (41) satisfied by the radial part of the wave 
function for the ground state of the deuteron has been studied in detail 
by A. H. Wilson.f The problem now is to determine a when b is known. 
Assuming a solution of equation (41) of the form 

O = pe -i*P, (42) 

so that d 2 <t>/dp 2 is equal to we have to determine the energy 

CO CO 

(° + ?)J a)0 * dp = ~ b J 
0 

CO 

Now J 0*0 dp = 

o 

so that we obtain the relation 

a== ~2(l+«)"*~*“’ 

To obtain the best value of a we minimize a with respect to a. In this 
way we obtain the parametric equations 


J p (!+<*)• 

[) 

6c* 3 


2 («-l) b = («+l) 

4(a-f-3) 9 a(a-{-3)’ 


giving the relation between a and b. Wilson compares this solution 
with that obtained mechanically by means of a differential analyser. 
The results are summarized in the first three columns of the table below. 


a 

Analyser 

b 

Wilson 

Hulthhi 

000 

-1*70 

— 2-00 

-1-68 

-0*25 

-2-73 

-2*81 

— 2-77 * 

-0-50 

-315 

-3-23 

- 3*20 

-0-75 

-3*48 

-3*54 

-3*52 

-1-00 

-3*76 

-3-83 

-3*82 


The second column gives the value of b , corresponding to the value 
of a on the left, as determined by the differential analyser. The third 
column was constructed by calculating b from equation (43) for the 
value of a necessary to give the prescribed value of a . It will be observed 
that these values of b lie close to each other except in the neighbour¬ 
hood of a = 0. Wilson also computed the relations between a and b 
arising from taking a wave function of the form 

0 = p(e~-\0Lp__ e -^p) 

t A. H. Wilson, Proc. Camb. Phil. Hoc. 34, 365 (1938). 
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containing two parameters a, /?; but he found there was no marked 
improvement on that obtained from the wave function (42). 

More recently Hulthenf has employed an approximate wave function 
of the form $ _ (44) 


where y is a parameter. The values of b resulting from this wave 
function are given in the final column of the table. The agreement with 
the exact solution obtained from the differential analyser is very close 
in this case (cf. Fig. 1 of a paper by Frohlich, Ramsey, and Sneddon ).% 
In calculating the binding energies of more complicated nuclei 
Hulthen’s wave function (44) is not much more accurate than Wilson’s 
approximation (42) and the latter is easy to handle analytically. By 
means of these calculations and a similar one for IP it is possible to 
calculate the constants F, G, m of the Moller-Rosenfeld theory,§ and 
to show that it leads to results in agreement with the observed angular 
dependence of the neutron-proton scattering.$ The value of the meson 
mass obtained in this way is about 220 electron masses, while cosmic- 
ray observations give a mean value of about 200 electron masses.|| 


t L. Hulthen, Arkiv f. mat. astr. o.fys . 28, No. 5 (1942). 

t H. Frohlich, W. Ramsey, anil T. N. Sneddon, Rep. lnt . Cony . on Fundamental 
Particle *, Cambridge (1946), p. 166. 

§ H. Frohlich, K. Huang, and T. N. Sneddon, Proc . Roy. Soc. A, 191, 61 (1947). 

|! W, F. 0. Swann, Report* Proy, Physic*, 10, 46 (1944-6). 



VI 

ATOMS WITH MANY ELECTRONS 

Introduction 

In the last chapter we discussed the quantum mechanics of monatomic 
systems next in order of complexity to the hydrogen atom, namely 
those in which two electrons move in the field of a heavy nucleus. 
We shall now consider some of the methods employed to extend the 
analysis to more complex monatomic systems, in which more than two 
planetary electrons revolve round a heavy nucleus. One of the most 
important methods of attack on this problem is omitted here because 
it requires a knowledge of the theory of groups. Accounts of the 
applications of the theory of groups to the discussion of complex atomic 
spectral should be consulted for details of this method—one of the 
most powerful for the derivation of general results. 

We shall be concerned here mainly with an account of the various 
methods of calculating the energy levels and wave functions of complex 
atoms. In many problems a knowledge of the wave functions of complex 
atoms is necessary for the determination of physical quantities such 
as the radial charge density; an account is given of methods of obtaining 
the wave functions numerically and of approximating to these wave 
functions by means of analytical expressions. 

27. The wave functions of many-electron systems 

27.1. The Schrodinger equation in atomic units 

In order to avoid the frequent repetition of the constant 2m./h 2 it is 
convenient to express the SchrOdinger equation in the atomic units 
first introduced by Hartree.J In this system of units the unit of length 
is chosen to be the radius of the lowest orbit in the Bohr theory of the 
hydrogen atom, r 0 — h 2 jme 2 . Taking the unit of charge to be the 
electronic charge, c = 4-8025 X 10~ 10 e.s.u., and the unit of mass to be 
the rest mass, m, of the electron (9*1067 x 10~ 28 gm.), we find that r 0 has 
the value 0*529 x 10~ 8 cm. 

From these quantities we derive the unit of energy 

€ = e 2 /r 0 = me 4 /^ 2 = 4*354 x 10 ~ u ergs = 27*20 eV. 

t See, for example, H. Weyl, Qruppentheorie und Quantenmechanik (Leipzig, 1928; 
Eng. trans. London, 1931); C. Eckart, Be.v. Mod. Phys. 2, 305 (1930); E. Wigner, 
Qruppentheorie (Braunschweig, 1931); B. van der Waerden, Qruppentheoretinche Methode 
in der Quantenmechan ? k (Berlin, 1932). 

J D. R. Hartree, Proc. Camh. Phil. Soc. 24, 89 (1928). 
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The energy e is twice the ionization potential of the hydrogen atom in 
the ground state. The unit of velocity is automatically fixed by the 
relation \mv\ = c once the units of energy and mass are fixed. It turns 
out to be the velocity of the electron in the ground state of the hydrogen 
atom (again, in the Bohr theory); i.e. we have as the unit of velocity 

v 0 = e 2 jh = c/137-0 = 2-188 X 10 8 cm./sec. 

The unit of time is then chosen to be 

t 0 — r 0 /v 0 = 2-4188 X 10- 17 sec. 

In terms of atomic units the SchrOdinger equation reduces to the simple 
forra VY+2 (W~V)<f> = 0. (1) 


27.2. The wave junctions of many-electron atoms 
Employing atomic units we may express the Schrudinger equation 
for a neutral atom containing Z electrons in the form 


{i| y?+(w-v))t = o, 

where Vf denotes the Laplacian operator 

a 2 a 2 a 2 


( 2 ) 

(3) 


(Xpy^Zj) being the rectangular Cartesian coordinates of the jth electron 
referred to the nucleus as origin. For the potential energy in the same 
units we have 

F--V-+S'-, (4) 

T jk 


Z „ Z , 

- >-+ >- 
J>k r * 


where denotes the distance of the jth electron from the nucleus and 
r,j k the distance between the jth and kt\\ electrons. The Schrodinger 
equation (2) can seldom be solved exactly, but by making certain 
assumptions about the nature of the potential energy V approximate 
solutions of equation (2) may be obtained. Of these assumptions the 
simplest is that it is possible to construct a potential function of the 
type 


V = 2 F(r,), 


(5) 


which is a good approximation to the actual potential function V of 
equation (4). This is equivalent to considering the mutual repulsions 
of the electrons to be averaged out in such a way that each electron 
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moves in a static field which is independent of the positions of the other 
electrons. Substituting from equation (5) into equation ('!), we obtain 

Z{lVf -V(rj)}<{, I lly, - 0. (0) 

1 

An equation of this type may be solved by setting for ifj a product of 
n wave functions, ^(r ; .), each of which is a function of the coordinates 
of one electron only. Thus if ^(iq) is a solution of the wave equation 

[Ff+{F 1 -F(r 1 )}]^ l (r 1 ) = 0, (7) 

and if ^ 2 (r 2 ), 0 3 (r 3 ),..., \p z (r z ) are defined similarly, it can readily be 
shown that the product 

ip — ^ 1 (r 1 )0 2>{T2i*"'pz( T z) 

is a solution of the Sclirodinger equation (6) corresponding to the energy 

W = W 1 +W t +...+W z . 

If we now introduce the electron spin, we may associate either of 
the two spin wave-functions yjcr,.), xpfoj) with each of the spatial wave- 
functions ip(Tj) so that the spin is considered as doubling the degeneracy. 
To denote that the wave functions are dependent on the four variables 
x, y, z, a for each electron we introduce a vector q y to denote the four 
quantities r jf uj. The wave functions corresponding to the jth. electron 
are then written symbolically 

*(q j) - WxK). (8) 

Thus if we include the electron spin there will be at least two (and, in 
the case of degenerate functions i/j , several) wave functions <p corre¬ 
sponding to each of the eigenvalues W 1} W k ,... of the parameter W 

of equation (7). The <j> wave functions corresponding to the eigenvalue 
W k of the energy may be denoted by (j> < ^ ) ) where the parameter p, ranges 
over the number of values appropriate to the degree of the degeneracy. 
A solution of the wave equation (6) is obtained by choosing any Z of 
the functions substituting in them the coordinates of the electrons, 
and forming the product. Thus, if from the class of functions (p^ we 
select a finite set which we shall denote simply by 

then <p 12 z defined by 

is a solution of the wave equation (6) corresponding to the energy 

w - W 1 +W 2 +...+W z . ( 10 ) 

K 


5072 
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There are, of course, several ways in which the energy W can be 
distributed among the electrons, for instance, the wave-functions 

^ 12 ...z ~ 

and ^ 2 i...z “ ^ 2 ( < 1 i) < ^i(Q 2 )”*^z( ( 1 z) 

are quite distinct, although they correspond to the same value of the 
energy. In fact it is easily seen that there are Z\ distinct wave functions 
corresponding to the same value of the energy. If P denotes any 
permutation of the suffixes 1 , 2 ,..., Z , then the wave functions 
of such a set may be written symbolically as 

~ -^{ IT <£/<;( <!&)}• 

Furthermore, if a P is any constant, then the wave function 

P P 1 k 1 

where the summation is taken over the Z\ different permutations 2 J , is 
also a solution of the wave equation (6) for the same value of the 
energy W. Taking a v to be +1 for even permutations and — 1 for odd 
permutations we obtain the solution, anti-symmeirical in the q’ s, 

^i(Qx) fc(qi) • • • <f>M 

<f>M <f>M • • • M q*) , 11X 


I (Qz) • • • ^z(Qz) I 

This is the form that has to be used in the solution of problems involv¬ 
ing atoms and molecules. The interchange of two electrons leaves <M>* 
unaltered, as it must since it is impossible for us to distinguish between 
two electrons, and <I> vanishes, in accordance with the Pauli exclusion 
principle, if two of the electrons have their q’s equal, f A wave function 
written in this way to include space and spin coordinates alike has 
many advantages over wave functions constructed in the space coordi¬ 
nates and the spin coordinates separately, especially in that it avoids 
many applications of the theory of groups which were a necessary 
consequence of this method of attack. 

Some authors write the determinant (11) in the form 
$ = Det{^ 1 (q 1 )^ 2 (q 2 )...^ z (q z )}, 
exhibiting only its leading term. 

There may be several wave functions of the type (11) with the same 

t P. A. M. Dirac, Proc . Roy. Soc. A, 112, 661 (i 926). See also .T. C. Slater, Phys. Rev . 
34, 1293 (1929). 
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energy. For example, the excited states of a two-electron atom may 
be described by wave functions of the type 



l Am( r l)Xa( <T l) 

lA»( r l)Xa( ff l) 


<Pm( r 2)X<>2) 

<An( r a)X^( CT 2) 


l Am( r l)Xa( or l) 


2 

'l'm( r t)XeX a ») 

^n( r a)XpM 

<i > 3 = 

>Pm( r i)Xp( a l) 

'l'n( r l)x<x( a l) 


&n( r 

&.( r *)Xe«K) 

<t>, — 


<An( r l)XjSK) 

4 

&»( r t)xd?t) 

^»(r 2 )x/ 5( <7 a) 


each corresponding to the energy W m +W n . 

As long as we assume the potential to be of the type (5) any linear 
combination of the <h’s will be a solution of the SchrOdinger equation. 
But if we use the correct form (4) the degeneracy will disappear either 
wholly or in part, and a first order approximation to the solution will 
be given by particular linear combinations of the O’s. In the case of 
the helium atom, for instance, the approximate wave functions are 
0 3 , or any linear combination of 0 lf <h 4 , and 0 2 +4> 3 . 

27.3. The electron distribution function of a many-electron system 

In the non-degenerate case the approximate solution of the Schro- 
dinger equation is a single determinant. In the degenerate case the 
approximate solutions are a restricted set of linear combinations of 
determinants. Even in this case, however, it is usually possible to 
choose some of the solutions to be single determinants. In this section 
therefore we shall consider the electron distribution when the solution 
is a single determinant. 

If the wave function O of a many-electron system is expressed as 
a determinant of the form (11) and, as is consistent with the results 
of § 27.2, the functions <f> are orthogonal to each other and normalized 
to unity, then the probability of finding one electron in the volume 
element dr v another one in the volume element dr 2 , and so on, is 

pdq 1 dq 2 ...dq z . 

Here dq = drdo includes both space and spin coordinates and 

^i(Qi) • • • fiziQi) ‘MQi) * • • ^z(Qi) 

. • . 4&<u) x &(q.) • • • Mq.) 

• • • ^z(qz) ^i(q,z) * • • 
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If we introduce the notation 

pu ~ IPMMMi), 

the expression for the distribution function redu< 

/hi P12 Piz 

P21 P22 P2Z 

P = 


I Pzi PZ 2 • • • Pzz 

The probability of finding m electrons in a certain prescribed configura¬ 
tion irrespective of the position of the other (Z ~?n) may bo obtained 
by integrating p over the coordinate spaces of the remaining Z—m 
electrons. The integrations*)* must be carried out in such a way that 
each configuration appears only once; that is, dq m + 1 dq m]2 ...dq z must 
be replaced by dq m+1 dq m+2 ... dq z \(Z—m)\ 

Since the functions <f> are normalized to unity we have 

f Pzz^qz = 'L f ^tiQzWkiQz) dqz = 0^) 

J &=1 J 

Similarly, by making use of the fact that the </>\s form an orthogonal 

set, we can show that r , 

J PizPzj dq z — Ply (15) 

Now if we denote by p z _ x the determinant formed from p by removing 
the last row and the last column, and by A^_ x the cofactor of p, tj in 
the determinant p z ~ v then expanding the determinant p in terms 
of the last row and column we obtain 

z z 

P “ PzzPz- 1 ”" 2 2 PizPzi^z-v 

i=i 1 

Integrating over the variables of the Zth electron and making use of 
equations (14) and (15) we obtain 

J P dq z = Zpz-i~~ (-£—1 )pz~i = Pz- 1 > 

showing that the effect of integration with respect to the coordinates 
of the Zth electron is merely that of removing the last row and column 

f In the integrations we use dr i to denote dx^ dy i dz i and dq i for the volume element 
including spin. The integration over the spin variables is merely a sum: thus 

f dq { « f dx i dy i dz i 2 • 

J J «,& 


( 12 ) 

to the form 

(13) 
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from the determinant. Carrying out the integrations with respect to 
the coordinates of the other electrons we obtain the relations 

J P dq z -idq z = 2p z _ a , 

J P <ks ~dqz ~ (Z— 

J P dq 2 ...dq z = (Z —l)!p n . 

As previously explained, the probability of finding any one of the 
electrons in the volume element dr l is given by the sum of the distri¬ 
bution functions corresponding to the one-body problem, i.e. 

dr i j = ^ J^*( r i)^*( r i)^ T i- 


27.4. The energy of a many-electron system 

The Schrodinger equation for a system of Z electrons moving in the 
field of a nucleus of atomic number Z may be written in the form 

(L—W)® = 0, (16) 

where, in atomic units, 



*v? + |). (18) 

If we multiply both sides of equation (16) by O*, substitute for <1> from 
equation (11), and integrate over the whole of the configuration space 
(i.e. over spin as well as spatial coordinates), we obtain the relation 


and 


IL 


H 


w 




(19) 


where p is defined by equations (12) and (13). It should be noted that 
in this equation the operator 11 j must be taken to act on p tj but not 
on pj •; this is a consequence of the fact that Hjp is formed from the 
product <!>*//,•<!> in which Hj operates on <!> but not on The integra¬ 
tions can be performed by methods similar to those outlined in the 
preceding section. For example, we have 


1 

(Z-2)! 


f pjk _ 

f J_ Pa 

Pik 

J r ik J 

1 r jk Pkj 

Pkk 


dq.jdq k . 


( 20 ) 
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From the definition of the quantities p we see that the right-hand side 
of this equation is equivalent to 


z z 


22 f 

A~i /x-i* 

Z Z r 

22 ( 21 ) 

r~I J r ik 


A=lfi=l* 


Since only certain of the orbital wave-functions cf>\ will have the same 
spin factors, they alone will give a non-zero result when the integrations 
over the spin variables are performed; the remainder are orthogonal 
and the integrals involving them vanish. Thus, if the spin wave func¬ 
tions are normalized to unity and if we write for the integrals over the 
spatial coordinates 

(«&|0|«J)= f ra(rM(rk)^Urj)M r K) (22) 

J r jk 

we find that the first term of (21) may be written 


i (AHGIV). 

A,/x=l 

In the second term of (21), on the other hand, certain of the terms will 
vanish; it reduces to z 

s (/xAIG'IV), 

where the summation symbol S implies that the sum is taken over 

only those pairs of numbers A, p for which the corresponding orbital 
wave functions <f)\ and ^ have the same spin factor. Furthermore, 
since there are \Z(Z—1 ) lines passing through Z points we have that 


2 / 

j>k J 


= \Z(Z- 1) f p —^, 
J r jk 


'jk 


so that finally 

2 f -- = UZ'Ml (VK?IV)~ «0*A|G|V)1. 

f>k J > k v ^ 


(23) 


The integrals (ab\G\cd) occurring in equation (23) and defined by 
equation (22) are known as generalized exchange integrals. 

The integrals involving p and Hj only may be evaluated in a similar 

fashion: we obtain r 

J P dq=Z\ 

_ c x , (24) 

\\lI jP dq = Z\y (A|//|A)J 

j J A^l J 

(A|#|A) = f MrJHjipx( r j) dr p (25) 


where 
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drj denoting as usual the volume element of the spatial coordinates. 
Substituting from (23) and (24) into equation (19) and dividing through¬ 
out by Z! we obtain finally for the total energy of the system 

w = a |(A|H|A)+|( 2 (AHW-^iGM), (26) 

where the summation S is taken over those pairs of spatial wave 
A ,fi 

functions an( l which have the same spin factors in <h. W can be 
calculated from equation (26) only if the orbital wave functions of the 
system are known. Equation (26) shows, however, that the energy of 
a many-electron atom is made up of three components: 

(i) the sum of the energies of each electron in the presence of the 
nucleus alone; 

(ii) the sum of the Coulomb interactions of electrons in one orbital 
with those in another; 

(iii) the sum of the exchange interactions between two electrons in 
different orbitals but possessing the same spin. 

28. Variational methods of determining wave functions and 
energy values of many-electron atoms 
28.1. Fode’s variational equations 

The Schrodinger wave equation (16) may easily be shown to be 
equivalent to the variation principle 

8 J 0*(//— Jf )0 dq = 0, (27) 

8 denoting an arbitrary change of O subject to the normalizing condition. 
The integral is taken throughout the configuration space of all the 
coordinates (including those describing the spin) of all the electrons. 
By the methods of the calculus of variations we can establish that 
equation (27) reduces to 

| m*(H-W)<S> dq = 0, 
which yields the variational equation 

If J 80*0 dq = J 80*170 dq. (28) 

In the many-electron problem <I> is a determinant of the form (11), 
so that equation (28) takes the form 

= ZHf | J Bip* </r„ dr, (29) 
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where p is defined by equation (13). As before, the quantity II^ in 
equation (29) operates on the coordinates of the jth electron when they 
appear in terms such as p^ but not when they occur in terms of the 
type p j{ . Contrariwise, operates only on p {j and not on p j{ . With 
this convention we have 


Pll 

P 12 

• • HjPij 

PlZ 

P 21 

P 22 

. . H jPij . 

P2Z 

&ipil 

&iPi2 • 

. . $£ Hj pij 

• • $iPiZ 

Pzi 

PZ2 

• • HjPzj 

* • Pzz 


and performing the integrations as in § 27.3 we find that if i / j 


J 8 { H jP dr = (Z—2)l J | d qi dq p 

which may be reduced to the form 
J 8 i Hjp dr 

= (Z- 2)l( I (m\H\m) J 8ijj p {r dri— S (ra|i/|p) J S0*(/r m d^J. 

K m,v*= 1 J m,p J 1 

Similarly jS t Ht P dq= (Z- 1)! 2 / S0*(r<)£ty(r«) dr it 
so that finally 

J (?8«)(p,)^ 

Z r ( (7?) Z , 

= 2 J m[u+p- 8 {m\H\p)4> m + 2 (*«|# MlM dr, (30) 

p-\ J K m m=* 1 ' 

(P) 

where the summation S extends over all those i/j’s which have the 

TO 

same spin factor as in the expression for O. 

By a method similar to that employed in the last section for the evalua- 

J cZt 

—- we can establish that, for i ^j 9 i ^ k , 
- mil (lm\a\lm)~ 8 (lm\G\nd)\ f f dr - 

'Cm l.m - > n-1 J 

~(Z\)[ 2 ( lrn\G\ln)~ 8 {lm\Q\nl)\ f Ht^dr, (31) 

W,TO,n l,m,n > J 
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the summation 8 being taken over those sets of spatial wave functions 

l,m,n 

ifj which have the same spin factor in O. The result for i — j or i — k 
can be derived in a similar manner; it is found to be 


J = Z! %J J ^ m G mn (r)dr.. 

G mn (r) = j ^(r')-A„(r')|^ 


where 

and 


F(r) = I(?„„(r) 


(32) 

(33) 


►Substituting from equations (26), (30), (31), and (32) into the variational 
equation (29) we obtain series of terms of the type Since we 

may treat the variations 8i/j* as independent, we may take / = 0; in 
this way we obtain a set of equations of the type 

[H+V(r)]eMr)- n s(G mn (r)+(m\H\n)+ £ (lm\G\ln)}* m + 

+ /? (lm\G\nl)>p m = 0, (34) 

L,m 

where e n denotes the number of times, 1 or 2, ip n occurs in the wave- 
function €>. The system of linear equations (34) for the determination 
of the functions i/j n were first derived by Fockf by a different method; 
the derivation outlined above was given subsequently by Lennard- 

Jones.J 

in) 

Simple expressions for the summation symbols 8 can be derived 
in certain special cases. For example, if the atom consists of p pairs of 
electrons and q—p outer electrons possessing the same spin but different 
orbital quantum numbers, then when n refers to one of the orbitals in 
a closed shell (n < p) the wave function may be associated with either 
of the spin wave-functions x a , xp- I* 1 case > then, e n — 2 and 

(n) P a (n) P V Q Q 

8=1 + 2 . <8 = 2 1 + 2 2- (35) 

m m—1 m—1 l,m i=lm=l l—l m— 1 

On the other hand, if p < n < q, € n = 1 and 




1 


( n) 
8 = 
l,m 


<L Q 

-2 2 - 

1^1 m I 


( 3 «) 


By this method we have reduced the problem of solving the Schro 
dinger equation in 3 n independent variables to that of determining the 
solutions of a system of n partial differential equations each with three 

f V. Fock, Zeits.f. Physik , 61 , 126 (1930). 
j J. E. Lennard-Jones, Proc. Camb. Phil . Soc. 27, 469 (1930-1). 
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independent variables. The equations are not independent, but they 
lend themselves readily to solution by successive approximations. 


28.2, Hartree's method of the self-consistent field 

The simple method of calculating the field of an atom devised by 
D. R. Hartree and discussed in the last chapter for two-electron atoms 
may readily be extended to atoms containing many electrons. Each 
electron is assumed to possess a wave function which obeys a SchrO- 
dinger equation. The potential energy appearing in this equation is 
taken to be partly that due to the field of the nucleus and partly to 
the fields of the electrons in the other orbits. The latter fields are 
calculated by representing each electron by a uniform charge distribu¬ 
tion of density —etfnfj*, in accordance with the Schrodinger interpreta¬ 
tion of the meaning of the wave function 0. The ith electron thus 
moves in a field in which its potential energy is 


m = 



l^(r')l s 

|r—r'i 


dr', 


(37) 


where we have considered a system of Z orbital electrons moving in 
the field of a nucleus of atomic number Z. The wave function fa of 
this electron then satisfies the SchrSdinger equation 


[ V 2 +2 {W-Vfjlfa = 0 


(38) 


with Vi( r) given by equation (37). 

In applying this method to atoms containing many electrons Hartree 
has neglected the ‘exchange’ forces. Indeed Hartree s equations may 
be derived from the system of equations (34) by neglecting 

(i) the coefficients of every i[t m for which m =4 n; 

(ii) the non-diagonal exchange integrals (ln\G\nl). 

Fock’s equations (34) then reduce to 

(#+ g' ^(r)j^(r) = [(n\H\n)+ g' (lm]0\lm))^ n (r), (39) 

the dash denoting that the term l = n is omitted in the summation. 
It will be seen that {n\H\n) is the energy of the nth electron in the 
field of the nucleus alone, (ln\G\ln) is the averaged electrostatic 

energy due to the interactions between all the other electrons, and 
^ G u (r) is the potential of the nth electron due to the Coulomb fields 

of all the other electrons averaged over their respective probability 
charge distributions. These equations may be solved numerically 
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by smoothing out the fields G u ( r) so as to make them spherically sym¬ 
metrical, and adjusting them until the solutions of the equations for 
the ^-functions reproduce the potential field.f 

28.3. Solutions of the Hartree-Fock equations 

The method of the self-consistent field as developed by Hartreo and 
his collaborators is usually much easier to work with than the Fock 
method but, of course, it is not such a good approximation. In both 



Fkj. 19. Radial distribution of charge for Rb+ and contributions from different groups 
of core electrons. Radial density in electrons per atomic unit plotted against r in 
atomic units. 


methods the solutions of the basic equations can be obtained only in 
a numerical form. Solutions of the Hartree equations of the self- 
consistent field and of the Fock equations have been derived for various 
atoms and ions. One way of representing graphically the results of 
these calculations is to calculate for each atom the radial charge density 
for a number of distances and then to illustrate the variation of this 
density by means of a diagram. Such a diagram is shown in Fig. 19; 
it shows the variation with distance of the radial charge density for 
the ion Itb + and the contributions from the different groups of core 
electrons.^ In this diagram the distance r is measured in atomic units 

t D. R. Hartree, Proc. Camb. Phil . Soc. 24 , 111 (1927-8); see also J. A. Gaunt, ibid. 
328. 

t D. R. Hartree, ibid. 120. 
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and a different scale of r is used for r < 1 and r > 1. This is usual 
in diagrams of this type since a scale open enough to show the detail 
of the curves for r < 1 is unnecessarily open for r > 1 and the use of 
two uniform scales has seemed preferable to the distortion produced 
by a logarithmic scale. The regions of maximum charge corresponding 
to the K , L, and M shells are clearly evident on this diagram. There 
is, however, no peak corresponding to the N shell—only a flattening 
out of the curve; this is due to the fact that the contribution from the 
ilf-electrons is still appreciable in this region. 



The same features are evident in the curve (Fig. 20) showing the 
variation of the radial charge density for the chlorine ion Cl". For this 
ion both Hartree’s and Fock’s methods have been usedf and the curves 
corresponding to both sets of calculations are shown. It will be noted 
from this diagram that the two curves have approximately the same 
shape but that for large values of r they differ appreciably. Indeed, 
if we denote by hU{r) the difference between the radial charge densities 
in the two cases, then it can be shown that in the units chosen 

CO 

J r 2 8U(r)dr = 13-8, 

0 

and further that about 95 per cent, of this value comes from values 
of r greater than 3-4 atomic units, which is the 'ionic radius’:]; of Cl". 
It would thus be expected that the distortion of the wave functions by 
the presence of neighbouring atoms will be considerable in this region. 

t D. R. and W. R. Hartree, Proc. Roy. Soc. A, 156, 45 (1930). 
j W. L. Bragg, The Crystalline State , chap, vii, p. 115 (1934). 
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If we calculate the diamagnetic susceptibility, from these curves 
we find by the Hartree method (Hartree and Hartree, loo. clt) that 

x 4i :i io «, 
and by the Fock method that 

x = — 30-4X10* 6 . 

The observed value*)* is — 25 X 10" 6 , so that the use of the solutions of 
Fock’s equations has reduced the discrepancy between the calculated 
and observed values of y to about a third of the amount of the dis¬ 
crepancy when the results of the self-consistent field are used. The 
improvement which the solutions of the Fock equations makes to the 
calculated value of the electrical polarizability is also considerable. It 
seems then that the improvements effected by the use of the Fock 
equations—i.e. by the addition of ‘exchange' terms to the Hartree 
equations—are substantial. 

Solutions of the Hartree and of the Fock equations for other atoms 
and ions have boon obtained by several authors. The results are given 
in the papers cited below. No attempt has been made to divide these 
papers into two classes depending on whether the Hartree or Fock 
equations formed the starting-point of the investigation, since it not 
infrequently happens that in the same paper the solutions of both types 
of equations have been derived for purposes of comparison and also 
for determining the relative speed of convergence of both methods. 


P. R. Hartree 

Proc. Comb. Phil. Soc. 24, Ill 
(1928) 

Li, Rb f , Na+, CT~. 


Proc. Roy. Soc. A, 141, 283 (1933) 

Cl Cu+. 


„ 1 „ 143, 506 (1933) 

Cu+ K+, C,s. 


„ „ 150,96 (1935) 

A1+ 3 , Fo- Rb+. 


Phys. Rev. (2), 46, 738 (1934) 

Hg. 

P. R. Hartree and 

Proc. Roy. Soc. A, 139, 311 (1933) 

O in various stages 

M. M. Black 


of ionization. 

P. R. and W. Hartree 

Proc. Roy. Soc. A, 149, 210 (1935) 

Be, Ca, Hg. 


„ „ 150,9 (1935) 

Be. 


„ „ 156,45(1936) 

CP. 


„ „ 157, 490 (1936) 

Cu+. 


„ „ 164, 167 (1938) 

Ca++. 


„ „ 166,450 (1938) 

K+ A. 


Proc. Camb. Phil. Soc. 34, 550 
(1938) 

Na“, K“. 

J. MacPougall 

Proc. Roy. Soc. A, 138, 550 (1932) 

Si+ 8 . 

E. H. Kennard and E. 
Ramberg 

Phys. Rev. (2), 46, 1034 (1934) 

Na. 


t G. W. Brindley and F. F. Hoare, Proc. Roy. Soc. A, 152, 342 (1035); E. 0. Stoner, 
Magnetism and Atomic Structure, chap, ix, p. 4. 







142 ATOMS WITH MANY ELECTRONS § 28 


C. C. Torrance 

Phys. Rev. (2), 46, 388 (1934) 

C. 

L. P. Smith 

Phys. Rev. (2), 42, 176 (1932) 

He. 

V. Fock and M. J. 

Phys. Zeits. d. Sowjet. 6, 368, 

Na. 

Petrashen 

(1934) 



Phys. Zeits. d. Sowjet . 8 , 547 

Li. 

M. M. Black 

(1935) 

Mem. Manchester Phil. Soc. 79, 

Ag f . 


29 (1934-5) 

Mem. Manchester Phil. Soc. 79, 

Cr. 

M. F. Manning and J. 

75 (1934-5) 

Phys. Rev. (2), 49, 849 (1936) 

W 

Millrnan 

M. F. Manning and 

Phys. Rev. (2), 53, G62 (1938) 

Fe. 

Goldberg 

W. S. Wilson and 

Phys. Rev. (2), 47, 681 (1935) 

He (excited state). 

R. B. Lindsay 

H. L. Donley 

Phys. Rev. (2), 50, 1012 (1936) 

Si f2 , Si» 3 

R. L. Mooney 

Phys. Rev. (2), 55, 557 (1939) 

Cr++. 

W. Thatcher 

Proc. Roy. Soc. A, 172, 242 (1939) 

K+ 2 , K+. 

D. R. and W. R. Har- 

Phil. Trans. Roy. Soc. A, 238, 229 

O 

tree and B. Swirles 

(1939) 


A. F. Stevenson 

Phys. Rev. (2), 56, 586 (1939) 

He, OUT. 

A. Jucys 

Proc. Roy. Soc. A, 173, 59 (1939) 

C+4, C++. 

A. 0. Williams 

Phys. Rev. (2), 58, 723 (1940) 

Cu+. 

D. R. and W. R. Har- 

Phys. Rev. (2), 59, 229 (1941) 

Ge++ Go 

tree and M. F. Man- 

„ „ 60, 857 (1941) 

Si IV, SiV. 

ning 

M. T. Antunes 

Phys. Rev. (2), 62, 362 (1942) 

Co. 

D. R. Bates and 

Phil. Trans. Roy. Soc. A, 239, 269 

o-, 0 a -. 

H. S. W. Massey 

(1943) 



28.4. Analytical atomic wave functions 

One of the simplest methods of determining the wave functions and 
energies of atoms which possess only a few electrons consists in con¬ 
structing a wave-function 0 which has the correct symmetry properties 
but which contains certain parameters a, y,... which are left un¬ 
specified. The most favourable values of these parameters are then 
determined by the variation method as follows. The function is substi¬ 
tuted in the equation 

f dq 

(40) 

J <E>*4> dq 

and the values of the parameters determined so as to make the energy 
W a minimum. In this way we obtain the system of equations 

Ba Bj 3 By 


... = 0 


(41) 
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for the determination of the parameters a, /?, y,... . Wave functions 
constructed by this method have proved to be of great value in the 
approximate calculation of the properties of atoms containing many 
electrons. 

This method was used by Hylieraasf to give a very good approxima¬ 
tion to the observed ionization potential of helium. The form of wave 
function assumed by Hylleraas was 

corresponding to a helium atom in its ground state. 

The wave functions for the atoms Li, Be, B, C, N, 0, F, Ne have 
been derived by this method by Zener and Guillemin.J For the ls- 
orbitals Zener assumes a wave function of the form 



1 

1! 

e 

(42) 

for the 2s-orbitals O 

— g 2 (r—a)r n ’- 2 e- hr . 

(43) 

and for the 2p-orbitals 

0> = 


(44) 


Here 1^(0,^) denotes the surface spherical harmonic of the first degree, 
^( 3 / 877)801 de^t. The function (42) is of the same type as the wave 
function for the hydrogen atom in the ground state; the parameter y 
may be regarded as allowing for the screening effect of the other 
electrons round the nucleus. For each atom the four parameters n*, 
oc, y, 8 are determined from the equations (40) and (41). The parameter 
n* has been varied only in the cases of Li and Be; since for both of 
these atoms it has the hydrogenic value 2-0, Zener assumed it to retain 
this value in the other cases oonsidered. It is found that for an atom 


of atomic number Z the parameter y is given approximately by the 


formula 


y = s-*. 


(45) 


The values of the remaining parameters are given in the accompany¬ 
ing table; by the use of these values the energies of the L-sliell were 
determined with an error of one or two electron volts. 


Table 1 



t E. Hylleraas, Zeits.f. Phyaik, 65, 209 (1930). 

j V. Guillemin and C. Zener, ibid. 61, 199 (1930); C. Zener, Phya. Rev. (2) 36, 51 
(1930). 
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Simple rules for obtaining the values of the four parameters y, a, n*, 8 
for most of the atoms in the periodic table have been given by Slater.f 
The charge densities derived from the functions (42), (43), and (44) with 
Slater’s values for the parameters agree fairly well with those found 
by Hartree’s method of the self-consistent field in the cases where that 
method has been applied. 

For some purposes there is not sufficient flexibility in the Zener- 
Slater wave functions, and, in addition, the separate orbitals are not 
all mutually orthogonal. The method has been improved by Morse, 
Young, and HaurwitzJ and by Duncanson and Coulson§ to remove 
these defects. These authors take the wave function of an atom to be 
the appropriate sum of determinantal wave functions in which the 
wave functions of the component orbits are assumed to be 

tp(ls) = ^(ii?a 3 l7T)e~i iar , 
ip(2s) = (3Y 2 //x)e~^ r }, 

ifj(2p 0 ) ~ J((i 5 c 5 lrr)r cos 9 e~i Lcr , 
ip(2p ± ) = , N /(/x 5 c 6 /277)rsin0e”^ cr±? ^. 

The numbers N ± and N 2 are numerical constants; N 2 is determined by 
the condition that the wave-functions 0(2s), are orthogonal and 
N t is a normalizing constant. The energy is minimized to determine 
the remaining four constants a, b , c, /x; if we are concerned solely with 
atoms in their ground states an absolute minimum of the energy exists. 
The results obtained by Coulson and Duncanson for the ground states 
of the atoms Li to Ne are shown below. The quantity 2/x may be 

Table 2 


Element 

ay. 

V 

cy 

/* 

Slater's 
value of y 

Li 

2-69 

2-26 

.. 

0*658 

0*650 

Be 

3-69 

3-28 

.. 

0*979 

0*975 

B 

4-69 

4-20 

1-20 

1-322 

1*300 

C 

6-69 

513 

1-66 

1-652 

1*625 

N 

6-68 

608 

1-91 

1-970 

1*950 

(> 

7-68 

7*06 

2*22 

2-289 

2*275 

F 

8-67 

8*02 

2*55 

2-611 

2*600 

Ne 

9-66 

8-97 

2-88 

2-934 

2*925 


regarded as the effective nuclear charge for the 25-electron at large 

t J. C. Slater, Phys . Rev. (2), 36, 67 (1930). 

J P. M. Morse, L. A. Young, and E. S. Haurwitz, ibid. 48, 948 (1936). 

§ W r . E. Duncanson and C. A. Coulson, Proc . Roy. Soc. Edin . 62, 37 (1944). 
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distances, and is seen to increase by about 0*35 for each extra electron. 

The formula ,, „ ,. 

fi = \{Z- 1) 

gives n in terms of the atomic number Z to an accuracy of about 2 per 
cent. 

29. The perturbation method of determining approximate wave 
functions of atomsf 
29.1. Non-degenerate cases 

In addition to the equations of Fock and Hartree, other approximate 
methods of determining the wave functions and energy levels of many- 
electron atoms have been devised. Suppose, for example, that the 
Sohrodinger equation (in atomic units) for an atom containing many 
electrons is written in the form 

(H— W)<b - 0, (46) 

where II denotes the Hamiltonian 

//- -ifvf+F, (47) 

.7 = 1 

Z being the number of electrons. If the equation (46) cannot be solved 
directly, it is often possible to obtain an approximate solution from the 
solution of an equation which is separable and which does not differ 
greatly from (46). Suppose that the wave functions and characteristic 
values of the Schrodinger equation 

(H'-W)<t> = 0 (48) 

z z 

are known, where H' — — £ % Vf + % Vj. (49) 

7=1 7=1 

Here each of the functions V] is a spherically symmetrical function 
of iy If a non-degenerate solution of the equation 

0 (50) 

can be obtained, the solution of equation (48) can be written as a 
determinant whose elements are solutions of equation (50); in most 
physical problems there is thus more than one wave function <& k for 
every characteristic value W k of the wave equation (48). For simplicity 
we shall, however, assume, in the first instance, that there is only one 
solution O*. to each value of W k and consider the degenerate case later. 
In the non-degenerate case the functions then form an orthogonal 
set. Equation (48) may still have non-degenerate solutions even if the 

f J. E. Lennard-Jones, Proc. Roy. Roc. A, 129 , 598 (1930); Journ. Lond . Math . Roc. 6, 
290 (1931). 

6072 


L 
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solutions of (50) are degenerate. Provided the only degenerate solutions 
of (50) used in constructing O are complete sets, then O will be non¬ 
degenerate; e.g. for the helium atom in the ground state we construct 
O from the two 16*-wave functions, and there is no other way of con¬ 
structing (1> to give the same energy. Moreover, provided the solutions 
of (50) are chosen to be orthogonal, as they always can be, the functions 
4> will be all orthogonal whether they are degenerate or not. 

Let us write J OffydT == (&(1|Z), (51) 

where the integral is taken over all the coordinates, spatial and spin, 
then if the wave functions <& k are normalized to unity (&|1|J) is merely 
the Kronecker deltaf S A:/ ; and (&|11 1) always vanishes for k 1 . 

To obtain the approximate solution of equation (46) we observe that 

z 

we have supposed equation (48) to be such that 2 K' d° es no ^ 

i 

greatly from V; we may thus write equation (46) in the form 

(tf'-W)® = -v®, (52) 

where v = F— 2 V] (53) 

j^i 

is small. It is assumed that the solution of equation (52) can be ex¬ 
pressed as a linear combination 

® = 2 a k®k 

k 

of the solutions of equation (48). Substituting this series in equation 
(52), multiplying by Of, and integrating over the whole range of all 
the variables specifying the wave functions, we obtain a system of 
linear algebraic equations 

|fl*{(l|£r'|i!)-ir(i|l|t)+(*M4)} = 0 (1=1,2,...) (54) 

for the determination of the coefficients a k and the possible values of W. 
In these equations we have written, in the usual notation for matrix 
elements, 

(l\A\k) = j<t>?A® k dq. 

It follows immediately from this definition and from that of //, H', v 

that (i|fr|*)+(i|®|*) = (i|£T|i), 

which is the matrix element of the Hamiltonian of the equation (46) 
with respect to the wave functions of the SchrOdinger equation (48). 

t The Kronecker delta is defined by the relations = 0 (k ^ l) t h kl6 — 1. 
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The system of algebraic equations (54) may then be written in the form 
|fl lk {(Z|J5r|i)-iy(i|l|i)} = 0. (55) 

In the general case the wave-function O is an infinite sum of terms of 
the form a k <S> k so that, in general, the system (55) consists of an infinite 
number of equations in an infinite number of unknowns. Eliminating 
the coefficients a k from the set of equations (55) we obtain the determi- 
nantal equation 

(l|JET|l)-fP(l|l|l) (1 \H\2>) (l|ff|3) . . . 

(2\H\l) (2\H\2)—W (21112) (2|//|3) . . . 

m\l) (3|H|2) (31//[3) W (3|113) . . . ' " 


In the case where the wave-functions <I> /c are normalized to unity, 

(Z|1|A) -- S lk ; 

also (l\Il'\k)^W k S lki 

where W k are the eigenvalues of equation (48). The determinantal 
equation then reduces to 


6? 

I 

T 

r—l 

IT 

pH 

(1|»|2) 

(1|«|3) • • • 


(2|®|1) 

(2|»|2 )-(W-W 2 ) 

(2M3) . . . 

= 0. 

(3|«|1) 

(3|®|2) 

( 3H3)-(JF-JF 3 ) . . . 

. 


. 

(56) 


Since the number of equations of the system (56) is infinite, the determi¬ 
nant on the left-hand side has an infinite number of rows and columns. 
The eigenvalues W k are known, and once the functions <& k are determined 
the matrix elements (l\v\k) can be determined by quadratures, and the 
values of W found. Suppose, for definiteness, we wish to determine the 
change in the value of As a first approximation we assume 

(l\v\k) = 0, 

except when l = 1, k — 1, so that 

The second approximation is obtained by assuming (Z|?’|fc) to be zero 
except when either l or k is 1. Then 

I (i|„|i)_ ( nr— W5) (l M2) (1 M3) . . . 

(2\v\l) W 2 -W 0 ... 

(3|t>|l) 0 W A —W ... * 


0 = 
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whence we obtain the equation 

W = Tf x +(1N1)+J ^jj^) (57). 

]¥* 1 ^ 

for the determination of W. More accurate solutions are obtained 
by including more of the elements {l\v\k) in the expansion of the 
determinant. 

When the energy values have been determined by some numerical 
method, the corresponding wave-functions 

$ = I a k®k 

are obtained from the solution of the set of equations (50). The wave 
function can thus be written as a determinant of the type 


<Di 

$2 

... 

(2|t>|l) (2\v\2)-(W-W 2 ). 

. . (2\v\ m) 

(m|u|l) 

(m|?>|2) 

. . (m\v\m)~{W—W m ) . 


29.2. Degenerate cases 

We now consider the case, which occurs most frequently in practice, 
in which there is more than one wave function corresponding to 

each value of the energy W k . Suppose 
9 O’ there are a such functions associated with 
the same eigenvalue W k of the energy; 
denote these wave functions by 

(58) 

If we denote by A k the square of a 2 ele¬ 
ments corresponding to the set of wave 
functions (58), then the determinantal 
equation (56) assumes the form shown in 
Fig. 21, where A k + V etc,, are the square 
arrays corresponding in a similar way to the energies W^+i» etc. It should 
of course be noted that the elements of the determinant outside the 
shaded squares do not in general vanish. 

The determinant A k can be transformed to one in which all the 
elements are zero except those on the diagonal by a method similar 
to that of changing the axes of a quadric in higher space. Since the 
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wave functions ®J(i — 1,2,..., a) are degenerate it follows that the 
wave function a 

= (5») 

i—1 

formed from a linear combination of the functions OjL will also corre¬ 
spond to the energy value W k . The function (59) will be orthogonal to 
a similarly constructed wave-function 

(so) 

i —1 

provided that, when g is not equal to h, 

2 PihPiff — 0. 

1 

Thus if we can determine a set of a 2 coefficients fi ih such that 

2 Mio = o, 

i~ 1 

the new set of wave functions form an orthogonal set. With the 
set of wave functions so constructed the determinant A k becomes a 
diagonal one. Similar transformations can be carried out for the square 
matrices A k + V giving finally an equation of the type of Fig. 21 but 
now with the matrix A k given by 

,(i\v\i y-(w-w k ) o ... o \ 
A l 0 ( 2 \v \2 y-(W-W k ). . . 0 


NO 0 . . . (<x |C7 |oc) # —( IV — H^)/ 

the symbols (g\v\h)' denoting the matrix elements 
(g\v\h)’ = J <&l* 0 v<S>l h dq 


and A k+V .. of similar type. 

The transformed determinantal equation may then be solved approxi¬ 
mately by neglecting all the elements except those on the leading 
diagonal and successive approximations obtained by a variety of 
methods depending on the relative magnitudes of the elements neglected 
in the first step. Thus as a first approximation for the change in W k we 


obtain 


W = w k + / 


where g — 1 , 2 ,..., a; as a second approximation we may omit every 
(k\v\l) except those within the squares A k , A k+li ..., and those of one 
row and column which intersect on the diagonal, and so on as in the 
case of non-degeneracy. 
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Since the potential function V in a real atom consists of electrostatic 
terms, it should be noted that 


i r i i r ik 


j J k<j 

so that the diagonal elements of the determinant on the left-hand side 
of equation (58) are given by expressions of the type 


dq, 




Pll 

Pl2 

• 

• PlZ 

-ll 

f" 

P2 1 

P22 

• 

P2Z 

Z'J 

Pzi 

PZ2 

• 

• Pzz 


which can be evaluated to give 

~l‘,*■*+',Hi 


dq } dq k 


Pjj Pjk 
7 jk\ Pkj Pkk 

and can be expressed in terms of (lm\G\lm) and (lm\G\ml) as before. 

To illustrate the general principles of the method by which the fore¬ 
going analysis may be used to obtain detailed information about the 
properties of real atoms, we shall outline the procedure for a particular 
case, that of carbon, in which there are six electrons. If each electron 
moved in the same field, assumed to be spherically symmetrical, the 
wave functions would be of the form 


0„,,,m = fnjfr) /?(COS 0)ey"t, (61) 

n assuming integral values 1, 2,... and \m\ ^ l, l ^ n— 1. We thus 
obtain the wave functions including spin 

«=1» J = 0 0 1.0.0 Xo 01,0,0 Xp 

n = 2, l — 0 02,0,0 Xa’ 02,0,0 Xp 

n — 2, l = 1 02,1,1 Xa> 02,1,1 Xfl’ 02,1,0 Xct' 02,1,0 XfS 

02,1,-lXo 02,1-1 XP 

corresponding to the lowest energies. 

The six wave functions for the construction of the wave function of 
the system of six electrons may then be taken as the first four of these 
wave functions together with any two of the last six (all of which 
correspond to the same energy). Corresponding to the lowest energy 
we can thus construct 6!/4!2! = 15 wave functions each in the form 
of a determinant with six rows and six columns. We can construct 
similar wave functions for higher energies and then employ these wave 
functions as a basis for determining the properties of the real atom. 
The matrix elements of the Hamiltonian of the real atom are calculated 
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with respect to these wave functions and the determinantal equation 
constructed in the manner described above. The fifteen degenerate wave 
functions lead to a square matrix of fifteen rows and columns, and the 
first step in the calculations is to determine such linear orthogonal 
combinations of the wave functions as will transform this square matrix 
into one with diagonal elements. This procedure is repeated for the 
other square matrices A k and the roots of the determinantal equation 
determined by the method of successive approximations outlined above. 

This cumbersome procedure can, however, be avoided by a method 
due to J. C. Slater based on the above analysis; this will be described 
in the next section. 

29.3. Slater's method for complex atoms j* 

Slater’s method of determining the values of the energy levels of 
complex atoms is based on the fact that the wave function (61) repre¬ 
sents, not only a state of definite energy, but also one with definite 
angular momentum. The angular momentum about the z-axis 6 -- 0 
of a single electron defined by the wave function (61) is m measured 
in atomic units (mh in ordinary units). In the carbon atom we suppose 
as a first approximation that the six electrons move independently in 
the same central field and that the angular momentum of each electron 
is given by the value of m in the appropriate wave function. By 
assuming that the six electrons are moving in the same field we obtain 
the total angular momentum (in atomic units) merely by adding the 
m’s appropriate to each of the electrons; thus denoting this quantity 
by M z we have (62) 

The field in which the electrons move is not, of course, spherically 
symmetrical. The electrons exert forces on one another and it is no 
longer possible to determine the angular momentum of the individual 
electrons. These forces cannot, however, affect the total angular momen¬ 
tum; the value of M z given by equation (62) therefore remains unaltered 
despite the interactions of the electrons. 

In a similar way the total angular momentum due to the spins of the 
electrons is given by the formula 

$ = ( 63 ) 

where s denotes the angular momentum in the z-direction due to the 
spin of a single electron moving in a spherically symmetrical field. In 
atomic units 8 assumes either of the values ± 

t J. C. Slater, Phys. Rev. 34, 1283 (1929). 
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The values of M z and S z for the fifteen wave functions corresponding 
to the lowest energy of the carbon atom are shown in Fig. 22, the 
numbers of wave functions with the same values of M z and S z being 
shown in the circles which are joined to points whose coordinates are 
the appropriate values of S 2 and M z . 

Now the Hamiltonian operator is not changed by a rotation about 
the 0-axis so thatf the angular momentum commutes with the Hamil- 


A typical matrix element of M Z H is by the definition of a matrix 


product 


(m\M z H\n)^^{m\M z \l){l\H\n) 


= (m\M z \m)(m\H\n), 

since M z is a diagonal matrix and therefore (m\M z \l) — 0 if m L 
Also (m\HM s \n) = ^ (m\H\l)(l\M a \n) 

= (m\H\n)(n\M z \n). 

It follows at once that equation (64) is satisfied only if 

(m\H\n) = 0, 

when (m\M z \m) ^ (n\M z \n). Similarly it can be shown that (m\H\n) = 0 
when ( m\8 z \m) is not equal to (n\S z \ri). Thus the elements of the 
determinant on the left-hand side of equation (58) are zero, except 
when the two wave functions defining the matrix element have the 
same values of the momenta M z and S z . 

By means of this result and Fig. 22 we see that the matrix A 0 corre¬ 
sponding to the lowest energy of the electrons reduces to the form 



the dots denoting the 25 of the 225 matrix elements which are not 
zero; the matrix A 0 , thus consists of one square of three rows and 


t See equation (29 a), Chap. XII. 
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columns, two of two rows and columns, and eight diagonal elements. 
The roots of the determinantal equation 

Mol = 0 (06) 

may, or may not, be distinct. To determine which, if any, of these 
roots are multiple we consider next the total angular momentum M 
and the total spin momentum 8. 

Not only the component S z but also the total spin S is a constant 
of the motion. Furthermore, if the total spin is S, S z may assume any 
one of the 2$+l values 


8, S- 1, S- 2,..., -(£-2), — ($—1), 

and if we neglect the interaction of the spins with the space coordinates 
and the mutual interactions of the spins, to 
each of these values of S z there must corre¬ 
spond an equal value of the energy. Similarly 
the total angular momentum M is a constant 
of the motion and it may be proved that to 
each component M z of M there is related an 
equal value of the energy. It can also be 
proved that: (1) The eigenvalues of M 2 are 
Z/(X+1) where L is an integer. (2) To each 
eigenvalue of M 2 there correspond 2.L+1 in¬ 
dependent wave functions which can be 
chosen. 

The various groups into which the fifteen 
values of (S z , M z ) of Fig. 22 fall are shown in 
Table 3. In assigning a name to each of the three groups we have 
followed the language of spectroscopy in which a multiplet is denoted 
by 8 , P, D,... according as M = 0 , 1, 2 ,..., the multiplicity 2#+l 


2 

P 

P , 

© P 

0 _ 

P P 

o 

P P 


P 


Fig. 22. The S Z ~M Z diagram. 


Table 3 


M 

S 

Values of (S z , M z ) 

Name of group 

Energy 

0 

0 

(0, 0) 

1 S multiplet 


1 

■ 

(-1,-1), (-1,0), (-1,1) 
(0,-1), (0,0), (0,1) 

(1,-1), (1,0), (1,1) 

8 P multiplet 


2 

0 

(0,-2), (0,-1), (0,0) 

(0, 1), (0, 2) 

l D multiplet 

w 3 


due to the spin being written on the top left-hand side. Thus for 
M = 0 , 8 = 0 we write X S 9 etc. From this table it will be observed 
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that the values S 8 = 0, M z = 0 are common to the 3 P, multiplets 
and ^ = 0, M z = ^ 1 are common to 3 P, * 1 ). The table also shows 
that there are actually only three distinct values of the energy which 
we have denoted by w v w 2 , w Zi or, which is the same thing, that there are 
only three distinct roots of the determinantal equation (66), one being 
a simple root corresponding to the 1 S multiplet, another a ninefold root 
for the 3 P, and the remaining one a fivefold root for the 1 D. 

6-electron system ^Perturbation ^Atomic levels 


Fifteen coincident 
levels 



'S (one level) 

’D (five coincident levels) 


3 P (nine coincident levels) 


Fia. 23. The? lower energy levels of the carbon atom. 


So far we have made no approximation beyond assuming that the 
energies of the mutual interactions of the spins and the interaction of 
the spins with the space coordinates are of a lower order of magnitude 
than the energies being considered here. In considering the determi¬ 
nantal equation (Fig. 21), however, we introduce certain approximations 
by rejecting some of the elements. The determinant on the left-hand side 
of the equation is made up of a chain of squares grouped together along 
the diagonal and other elements lying outside these squares. As a first 
approximation we assume that these elements are so small that they 
may be neglected. The equation (Fig. 21) then splits up into two or more 
equations, one of which is |^4 | — o 

Now since the values S z = 0, M z = 0 are common to the l S> 3 P, 1 I) 
multiplets, it follows that the three-by-three matrix in (65) leads to a 
factor of the form 

(W—w x )(W--w 2 ){W-~w z ) - 0, (67) 

and the two-by-two matrices lead to factors 
(W-w 2 ){W-w z ) = 0 , 
and the complete determinantal equation is 

(W-w^W-w^W-Wz) 9 - 0 . 

It follow's that two of the roots of the equation can be found by a direct 
calculation based on the wave functions of the original six-electron 
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system. It is not necessary to calculate the wave functions of the atom 
itself, since these roots can be derived from a formula of the type (26). 

To determine the third root we make use of the fact that if we write 
the three-by-tliree factor of the equation \A 0 \ = 0 in the form 

— W+u n u 12 u V6 

U 21 — W+U22 U 2 . A = 0 , 

u 31 U 32 ^33 w 

it must be equivalent to equation (67). Equating the coefficients of W 2 
in these two equations we obtain the relation 

*>!+*>*+*>* = U u +U 22 + U 33 

for the determination of w 3 once w 1 and w 2 are known. Thus it is only 
necessary to calculate the diagonal elements of A 0 to determine all 
three roots w v w 2 , w. A . 

The method of Lennard-Jones and Slater sketched briefly above may 
be used to determine the energy levels of a complex atom in all cases 
in which the multiplets differ in type. If, on the other hand, there are 
two or more multiplets of the same type, say, 3 P multiplets, the above 
method yields only the sum of the energies of these multiplets. To 
evaluate the individual energies it is necessary to find those combina¬ 
tions of wave functions which make the appropriate square of the 
determinant diagonal. 

In conclusion, we summarize the main steps in the determination 
of the energy levels of complex atoms by the perturbation method; 
they are: 

(i) determining the wave functions of the electrons separately in 

z 

a spherically symmetrical field 2 

j=i 

(ii) deducing the approximate wave functions of the many-elcctron 
system in determinantal form; 

(iii) calculating the diagonal matrix elements of F— 2 K' 

i 

respect to these approximate wave functions. 

We indicated above that in many problems of physical interest elements 
of the infinite determinant lying outside the chain of squares strung 
along the leading diagonal are so small that they may be neglected. 
In the preceding analysis they were taken to be zero, but it is possible 
to obtain better approximations by taking them into account. Methods 
of doing so have been considered by Lennard - Jones, f 

t J. E. Lennard-Jones, Proc. Hoy . Soc. A, 129, 598-615 (1930). 
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30. The statistical method of Thomas and Fermi 

A semi-classical method of finding the density of electrons in a heavy 
atom has been formulated by Thomasf and Fermi4 This forms a fair 
approximation for heavy atoms only. The method is as follows: as in 
the method of Hartree, the electrons are pictured as moving in an 
electrostatic potential <D due to the nucleus and to the charge cloud 
of the electrons themselves. Electrons are supposed, much as in the 
theory of metals, to have all energies up to a maximum — eO 0 . The 
maximum kinetic energy is then, at any point (x, y, z), equal to 6(0—3> 0 ), 
and hence the maximum momentum of any electron at the point (x } y , z) is 

3W = V{2^e(<D-0> ff )}. (68) 

We now assume that a volume 8V can be chosen at any point containing 
a large number of wave-lengths (h/p) but such that O—O 0 is virtually 
constant within it. With this assumption, assuming that all electrons 
are in their lowest state and that they obey the Pauli exclusion 
principle, it follows that if n(x, y,z)8V is the number of electrons in the 
volume element Q # \ 3 

»<*,»,*) - Sr(ap)'. 

The 2 has its origin in the fact that two electrons are to be found in 
each quantized state. 

Substituting for p max from equation (68) we have 


n(x, y, z) — 




'(2me)K 


The potential function <1> is now determined by the Poisson equation 

Y 2 0 — —477/>, 

and p — — ne, n being given by equation (69). Thus finally we obtain 
for O the non-linear equation 

V 2 0 = «(<(70) 

where a = -^^(2me)K (71) 

The boundary conditions to be satisfied by the potential function O 
are readily determined. If the atom or ion is of atomic number Z, then 
for small values of r, is very nearly equal to Ze/r; thus the boundary 

oondition limr* = Ze (72) 

r-H) 

is valid. 

t L. H. Thomas, Proc. Camb. Phil. Soc. 23, 542 (1927). 

X E. Fermi, Zeits.f. Physik y 48, 73; 49, 550 (1928). See also L. Brillouin, UAtome de 
Thoma8-Fenni et la mithodz du champ ‘self -consistent' (Paris, 1934). 
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On the other hand, if the total number of electrons in the atom is 
N, then 

j n(x,y 9 z)dr = N, (73) 

where the integral is taken over all space; in the case of a neutral atom 

N= Z. 

Assuming radial symmetry and introducing new variables 

«- < Ze >‘ A ' * =< 74) 

we may write equation (70) in the simplified form 

*• 

- *»*-* (f > o). m 

The boundary condition (72) is now replaced by 

li™ </>(€) — !• (7«) 

Now the number of electrons in the volume between r ~ 0 and r is 


7 

z(r) — J 477 r 2 n(r) dr. 


z(r) 


V 2 <t>r 2 .dr = Z 


and, by Poisson’s equation, we can replace n by V ttc so tliat 

0 0 

Integrating by parts and introducing the boundary condition (70), we 
have / 


T 


z(r) = Zl !+£%-+), 


ud(f) 

; w 


whence 


1 _!<L> = 

Z 9 S 8$- 


(77) 

(78) 


This equation gives a simple method of determining z(r) graphically 
from the <f>-£ curve determined by the differential equation (75), the 
boundary condition (76), and a further boundary condition of the type 


lim^ 

f-H> dg 


= —ra. 


(79) 


Suppose that APP' is a curve determined in this way (cf. Fig. 24); A is 
the point (0,1), P is (£, </>). If we draw the tangent to the curve at P and 
produce it to cut the <f>- axis at B, then it is easily seen that the distance 
OB is equal to <£—and hence by equation (78) to 1 —z(r)/Z. 
Since OA — 1 it follows that AB ~~ z(r)/Z ; we thus have a simple 
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graphical method of determining the number of electrons within a 
sphere of radius r centred at the nucleus of the atom. 

The solution of equation (75) presents some difficulty; it is non-linear 
and involves a singularity at £ = 0. Furthermore, the boundary 
conditions cannot be satisfied directly since one must be satisfied at 



the singular point £ — 0 and the other at infinity. Since <f>' is infinite 
at £ = 0 it is not possible to use a Taylor expansion in the neighbour¬ 
hood of the origin; instead we use an expansion of the form 

<f> = 1— mg-\ a£*+b£ 2 +c(;i+d€*+.,.. 

With this form for the function cf> we have 

¥ = la£-*+2b+*ic£ h +M£+0(&) 

and - f^-|mf*+|af+0(f»), 

so that equating the two expressions we obtain 

a = h 6 = 0, c — —fra, d = \a = J. 

The coefficients of higher powers in the expansion have been determined 
by Baker;| he finds that for small values of <f> is given approximately 

by 

* - *™ 3 )£ f +0(£ 5 ). 

(80) 

Numerical solutions of equations (75) for larger values of £ have been 
obtained by Baker, and also by Bush and Caldwell,J making use of 
the differential analyser. It is found that there is only one solution 
of equation (75) which tends uniformly to zero as £ -> oo, the one 
corresponding to a value m __ 2*58856 

in equation (79). When m exceeds the critical value m x the curve cuts 

t E. B. Baker, Phys . Rev. 36, 630 (1930). 
t V. Bush and S. H. Caldwell, Phys. Rev. 38, 1898 (1931). 
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the £-axis at a point £ 0 (cf. Fig. 25); and when m < m x the curve 
decreases steadily to a minimum point (£ m , <f> m ), then increases steadily 
to infinity as £ approaches a certain value a, and finally decreases to 
zero as £ --> 0. Curves relating <£ m to are given in the paper by Baker. 



Asymptotic formulae for these solutions have been derived by 
Sommerfeld;f for m = m 1 he obtains for large values of £ 

( i | 3’886 144 

^~( 1 + (£/ 12 ») 0 - 772 j ~ |3 • 

For large values of £ this reduces to 

~ 144£- 3 , 


a solution which may be obtained by substituting 


* = At-* 

into equation (75) and equating powers of £ on both sides of the 
equation. Similarly when m <m ± it can be shown that 


(j) 

For neutral atoms we have 


400a 

(£-«) 4 


(f-*<*). 


lim z(r) = N — Z; 

r-Kc 

in other words, the curve <f>(£) tends to infinity in such a way that the 
tangent to it passes through the origin, since 

lim OB = 1 ——lim z(r) = 0. 

£-*<x> ^ r—>00 

The only curve which satisfies this condition is the curve A A' for which 


t A. Sommerfeld, Zeits.f. Physik, 78, 283 (1932); 80, 415 (1933). 
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ctyjd£ = — m x when f = 0. The curve is asymptotic to </>((•) = 0, i.e. 

O 0 = o, so that if <b is taken to be zero at infinity, O 0 must also be 
zero, which means that the maximum energy -~e<b 0 of an electron in 
a neutral atom is zero. 



0 10 40 

— r-> 


Fig. 26. Radial distribution of charge for Rb+. 


For positive ions the number of electrons N is less than the atomic 

number Z, so that *r 

limOB = l-~ > 0. 

£-*00 Z 

In this case we choose a curve with m > m 1 such that the tangent to 
the curve at the point P(f 0 ,0) where it cuts the axis (f> = 0 passes 
through the point P(0,1 —N/Z). The equation of the line BP is 

*>- ‘-DM)- 


which in the old coordinate system reduces to 


O-0, = e(Z-N)(i- i). 


and so represents the Coulomb field outside the ion. It follows at once 
that E 0 = — e<I> 0 < 0, 


showing that the maximum energy of the electrons in the ion is 
negative; the electrons cannot therefore escape from the ion. 

As an example! of the kind of results obtained by the Fermi-Thomas 
method, Fig. 26 shows the density of electrons in the heavy ion, 
rubidium + . The electron density is readily calculated from equation 
(69) once the wave function <f> defined by equation (74) has been found 
by numerical integration of the non-linear differential equation (76), 


t D. R. Hartree, Proc. Comb, Phil. Soc, 24 , 121 (1928). 
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The full curve shows the radial charge density calculated in this way 
and the dotted curve that calculated by the method of the self-consistent 
field (§ 28.2 above). It will be noted from this curve that the Thomas- 
Fermi method gives a good smoothed-out approximation to the distri¬ 
bution of charge constituting the self-consistent field. For this reason 
it is often useful to use the Thomas-Fermi method to obtain a first 
approximation to the field in the Hartree method. 

It would also appear that if the details of the field or charge distribu¬ 
tion are not considered to be important the Thomas-Fermi field or 
charge distribution may be sufficiently accurate. It should, however, 
be remembered that in this method the radial charge density for small 
r is proportional to r*, while in the wave-mechanical treatment it is 
proportional to r 2 . 


6072 
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VII 

INTERATOMIC FORCES 


31. Introduction 

All treatments based on quantum mechanics of the forces between 
atoms and molecules make use of a theorem due originally to Born and 
Oppenheimer.f This states that any problem in which atoms or 
molecules interact can be treated in the following way: 

First one supposes that the nuclei of the atoms concerned are at rest 
at points K v R#, N being the total number of nuclei in the 

system under consideration. One then obtains solutions of the SchrO- 
dinger equation for the electrons of the system moving in the field of 
these nuclei. These solutions will be written 0„(R; q)> where q are the 
coordinates, including spin, of the electrons, and W^(R) are the corre¬ 
sponding energy values. One then treats the motion of the nuclei by 
means of a second SchrOdinger equation, in which T^ t (R), together 
with the Coulomb interaction between the nuclei, plays the part of 
the potential energy function. If T*(R) is a solution of this equation, 
the wave function of the whole system is taken to be 

T-(RWJR,?). (l) 

This approximation is used in almost any problem involving inter¬ 
atomic forces, such as 

(a) The vibration and rotation of a molecule. 

(b) The (gas-kinetic) collision between molecules. 

(c) Calculations of the lattice energy or elastic constants of solids. 

The use of the wave function (1) is of course an approximation only; 
the neglected term in the wave function is of order (m/M)*, where m 
is the electronic mass and M the nuclear mass. For a proof, the reader 
is referred to the original paper. 

31.1. Rotation and vibration of diatomic molecules in their lowest 
electronic states 

The problem df calculating the energy levels has already been treated 
briefly in Chapter V. We start by assuming that the electronic problem 
is solved, and that the energy of the electronic system in its ground 
state is known for all separations R of the nuclei. Let U(R) denote 
the sum of this energy and the Coulomb interaction of the nuclei. 

t M. Bom and J. R. Oppenheimer, Ann. der Physik, 84, 457 (1927). 
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Then the SchrOdinger equation for nuclear motion is, according to the 
approximation introduced above, 

1 2 

As already shown (§ 23.2), if the centre of gravity is at rest, this equation 
reduces to * 2 

±-vy+ { W-U m = o, 


where ip is a function of R, the (vector) distance between the nuclei 
and M = Taking spherical polar coordinates, ip takes 

the form * = 

where/is the solution vanishing at the origin of 


r+~\w- u (R)- k(k ±^f=0. ( 2 ) 

The function U(R) will depend on the electronic ground state of the 
molecule under consideration, and will not in general be expressible in 
terms of simple functions. For stable molecules the energies of the 
lower states can usually be found by writing 


U(R) = U Q +\K{R-R,)\ 


which is a good approximation near the minimum of the energy curve. 
This method was adopted in Chapter VI. 

Another method is to make use of the function introduced by Morse ;f 
thi s is Z7(J?) = JP{1— 


The form of this function is qualitatively of the type that we expect 
for the force between atoms, except that it does not tend to infinity 
when R tends to zero; this is unlikely to be of importance. For the 
Morse function an exact solution of the SchrOdinger equation has been 
given for k = 0 and an approximate one for k ^ 0. The energy values 
are (if the minimum of the curve is taken to be zero) 

a^ 2 (^+i) 2 +J5A:(A:+l)+^>^ 2 (*+l) 2 —cx(^4-|)*t(A4-1), 


where 



x = 


4 F* 


B = 


Hir/r 


D 


h* 


128 


3 ( 


)' 


167T 2 Mr 2 0 D\ar 0 abl) 

It will be seen how, in this formula, the term hv(n-\- J) represents the 


t P. M. Morse, Phys. Rev . 34, 67 (1929); C. L. Pekeris, ibid. 45, 98 (1934). See also 
Pauling and Wilson, Introduction to Quantum Mechanics , p. 272. 
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vibrational levels, Bk(k-{-\) the rotational levels, and that the other 
terms are of smaller order of magnitude for small n , k . 

31.2. Review of types of interatomic force 

We must now give an outline of types of force between atoms and 
ions which give rise to the formation of molecules and solid bodies. 
Among these the simplest are the purely polar forces between ions. 
Thus a very satisfactory theory of the properties of crystalline alkali- 
halides (e.g. NaCl) has been built up on the assumption that these 
materials are formed from positively charged metal ions and negatively 
charged halide ions, which attract each other according to the inverse 
square law. An account of this theory is given in Chapter VIII. The 
introduction of quantum mechanics does not affect the calculation of 
electrostatic attraction between ions; it does, however, affect our ideas 
of the extent to which the forces other than the electrostatic attraction 
affect the cohesion. 

Secondly there are the so-called van der Waals forces between two 
neutral atoms or molecules. The van der Waals force may be defined 
as the force between two atoms or molecules when they are so far 
apart that their wave functions do not overlap. In general the potential 
energy of two molecules at a distance R apart, where R is great com¬ 
pared with molecular dimensions, can be expressed in the form 

-A/R*, 

where A is a constant. 

All other forces between atoms or molecules which are neither charged 
nor carry a permanent dipole arise only when they approach closely 
enough for there to be an appreciable overlap between the space covered 
by their respective wave functions. These ‘overlap’ forces are of three 
main kinds: 

(a) The repulsive forces which arise mainly between closed shells, 
such as rare gases and ions like Na+, Cl~, Ag + ; 

(b) The homopolar forces responsible for the bonds such as those 
between hydrogen atoms in H 2 or between carbon in diamond, 
graphite, or the molecules of organic chemistry; 

(c) Metallic forces; by these are meant the forces between the atoms 
of a metal or alloy in the solid or liquid state. 

32. Van der Waals’ forces 

32.1. Calculation of van der Waals 9 forces by classical methods 

We have seen in the previous chapters that an atom may be thought 
of as consisting of a small positively charged nucleus surrounded by an 
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electron cloud. For atoms which have spherical symmetry the centre 
of this electron distribution coincides with the nucleus. If, however, 
the atom is placed in a uniform electric field, the electron cloud is 
distorted in such a way that its centre is displaced away from the 
nucleus causing the atom to have an electric moment. The atom is 
then said to be polarized. It can be shown in a classical way that the 
van der Waals forces between neutral atoms can be interpreted in terms 
of these polarizations. We shall discuss this method before proceeding 
to the more accurate quantum mechanical calculations. 

If the strength F of the electric field is not too great we expect the 
dipole induced in the atom to be proportional to F. Denoting the 
magnitude of the dipole by p we may then write 

p = ocF , (3) 


where the constant oc is called the coefficient of polarizability. The 
energy W p of the dipole p in a constant field F is given by 

F 

W p = - jpdF=-ixF». (4) 

0 


If the field F is fluctuating about the value zero the mean of the 
polarization energy W p remains finite, 

% - -UF\ (r>) 


since the mean of the square of the field F 2 will not be zero. 

Let us now consider the field F due to n dipoles of strengths p n placed 
at a groat distance r from the atom and inclined at angles 0 n to the 
vector r. Then if r is sufficiently great the field will be uniform, and 
have components 


K = 


i 


COS 6 n 


*-2 


Fn 


sin d.„ 


n 


n 


A neutral atom in which electrons move in the field of a nucleus may 
be represented by an ensemble of fluctuating dipoles of this type; wo 
may thus write 

H'n es n> 

where s n is the distance of the nth electron from the nucleus. For such 
an arrangement we have 


** = js 4 (2 COS + (| s n sin 0 n ) 2 


r 8 ’ 


j8 being interpreted as a kind of mean square of the dipole moment of 
an atom. Substituting from equation (6) into equation (5) we find that 
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the effect of this atom on another distant atom is to change the energy 
by an amount W p given by 

W p = —ocplr*. ( 7 ) 

The interaction energy of two atoms with atomic polarizabilities ol v ol 2 
and mean square dipole moments fi v /? 2 * s thus 

“i Pt + VjP 1 
r 9 


W 


( 8 ) 


it being assumed that the effects are additive. 

This classical method of calculation shows the characteristic features 
of van der Waals’ forces. The negative sign in the expression (8) for 
the interaction energy implies that the two atoms attract one another. 
The force between the two atoms varies as the inverse seventh power 
of the distance; we shall see that this conclusion is confirmed by the 
use of the methods of wave mechanics. It is found, however, that the 
multiplicative constant a 1 j8 2 +a 2 j8 1 given by the classical theory is about 
five times too great if the value of jS is deduced from measurements 
of the magnetic susceptibility, f 


32.2. Polarizability of an atom in an electric field 
The considerations of the last section show that in order to obtain 
the van der Waals attraction between atoms it is necessary to calcu¬ 
late the dipole induced on an atom placed in a static electric field, and 
thus the polarizability of the atom. 

The calculation will first be carried through for an atom with one 
electron in a Coulomb field. The coordinates of the electron are denoted 
by (x,y,z). Then if an electric field F is introduced along the z-axis 
the perturbing term introduced into the Hamiltonian, in other words 
the potential energy of the electron in this field, is 



H' = -Fez. 

( 9 ) 

The SchrOdinger equation then becomes 



Vy+^W+j-eFrcosOy = 0. 

(10) 

If we take 

</j 0 = e- r '“, W 0 = —e 2 /2a 


and write 

t = M 1 —Fcosd.B(r)}, 

(11) 


then W will differ from W 0 by a term of order F 2 . Thus substituting 
from (11) into (10), making use of the equation 

VV„+J?(ni+^„=°, 

t P. Debye, Phys. Zeits. 21 , 178 (1920); 22 , 302 (1921). 


(12a) 
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and neglecting terms of order F 2 we obtain 
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VY+ : 




\<f> + -f^eFr cos 0*p o ~ 0. 


The ordinary differential equation for R(r) then becomes 

d 2 R J1 l\dR 2R == _2r 
dr 2 \r a) dr r 2 ~~ ea 9 

which has the particular integral 




To determine the complementary function we let 

R(r) = ^ a n rP +n 
0 

and equate powers of p—2 to obtain the indicial equation 

(p-l)(p+2) = 0. 


( 126 ) 

(13) 

(14) 


The root p = — 2 must be rejected since it would not lead to a function 
R which tends to zero with r. Putting p ~ 1 and equating coefficients 
of r 71 - 1 we obtain the recurrence relation 


Q'n = 2 n+1 
a n - 1 an{n+2y 

which shows that, as n -> oo, 

a n 2 

- ~ — , 

a n-l an 

so that, for large values of r, 

R(r) ~ e 2r ‘ a . 

Any wave function </> containing this complementary function violates 
the boundary conditions; <f> is therefore given by equations (11), (14). 
Hence to the first order we have 


^_e-*j 1 -^' c os«(r + li)). (15) 

Making use of the form (11) for the wave function tfj, and of the equa¬ 
tions (12 a) and (126) we have 

Htff = W q (i/j 0 —</>)—eFr</> cos 9 
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as far as powers of order F 2 . Thus neglecting powers of order F 3 we 
have * 


W = W 0 - 


J K4> ri 

J 00^0 


cos 9 dr 


= K-irvj 

0 0 ' 0 
The integrations are elementary and give finally 

But in terms of the polarizability a we have 

W = JF 0 -4<*F 2 , 

so that, since the two formulae are equivalent, 

a = ga 3 . ( 16 ) 

This corresponds to a value 0-677.10~ 24 cm . 3 for the polarizability of 
the normal hydrogen atom. 

An alternative treatment of the polarizability, valid for any field, 
is as follows: the considerations of Chapter 111 show that the perturbed 
wave function is _. v / 

n 71 0 

where i/j 0 (x,y,z) is the wave function of the ground state, i/j n (x,y y z) of 
any excited state, and 

Z 0 n = J ( 17 ) 

The charge density in the perturbed atom, neglecting terms of order 
F‘ 2 , is thus 

p(x,y,z) = R|«A 0 | a +e 2 i ’2 |^^ 0 »^o+complex conjugatej. (18) 

The second term is not in general spherically symmetrical and will thus 
give rise to a dipole moment. Thus, for instance, if t/r 0 corresponds to 
an 5-state and ip n to a 43 -state, the product </r 0 will be of the form 
zf(r), and will be as illustrated in curve (3) in Fig. 27. The change in 
the charge density of the atom is also illustrated. 

The dipole moment of the atom is equal to 


J zp(x,y,z)dr, 
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which, on substituting from (18) gives for the polarizability (dipole 
divided by field) „ |2 

~ I oJ (19) 


: 2e 2 


-2 




-w 0 - 


The quantities |Z 0n | 2 , here introduced for the first time, have con¬ 
siderable importance for the theory. 

It is convenient to introduce the 
dimensionless quantity f n defined by 
2m, 


fn 


ft 2 


(W n -W 0 )\Z 0 


The quantity f n is called the ‘oscillator 
strength ’ of the transition 0 -> n of 
the atom or molecule concerned. The 
reason for this is obvious. If we write 
v n -- (W n —WQ)lh and substitute in (19), 
we find for the polarizability a, 


a — 


e 2 V/. 

4:TT 2 m V , 2 ’ 



( 20 ) 


Ki«. 27. Charge density in an atom. 
(1)-unperturbed; (2) . per¬ 

turbed ; (3) ^£0o- 


Now a ‘classical’ electron bound to a fixed point with a restoring force 
such that it would vibrate with frequency v would give a polarizability 
e^jArrhfnv 1 . The real atom, therefore, behaves as though it were a 
collection of oscillators of frequency v n and strength f n . 

In Chapter X it will be shown that the quantities f n determine the 
intensities of the corresponding absorption lines. 

In the next section it will be shown that 


Ifn = 1 
n 

for a one-electron atom, or for a series of transitions in which only one 
electron takes part. Thus if, as is frequently the case, one absorption 
line v is much stronger than all the others, the polarizability a is given 
approximately by f 2 

c V n+m/ - 

477 -hnv 2 

as on the classical theory. 

The proof that for the hydrogen atom formulae (16) and (19) are 
equivalent is left to the reader. 


32.3. The f-sum rule 

We have in this section to prove that 

Ifn = I- 
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As a preliminary we show that 

J K&dr = ™(W n -W„) J V-*#„ dr. (21) 

This may be proved in an elementary way from SchrOdingers equation. 

Thus 2m 

V%+^(JF o -F)0„ = 0, 

and vy* + ^(W n -V)tf>* = 0. 

Multiplying the first equation by </r*z, the second by ^r 0 z, integrating 
over all space and subtracting, we obtain 

J (ftzVVb= J 

Since <A*zVYo = 

we have, making use of Green’s theorem, the required result (21). 

It follows that 

2/» - -2 2 J J ^-fjdr, 

n 

and, by the argument given on p. 95, the right-hand side is 

It follows from partial integration that this is equal to J and 

thus to unity. 


32.4. Calculation of van der Waals force based on wave mechanics 
As already stated, the term van der Waals’ force is used in this book 
to describe the force between two atoms or molecules which are far 
enough apart for the overlap between the wave functions to be 
neglected. 

We shall calculate the attraction between two hydrogen or one- 
electron atoms, a and 6, at a distance R from each other. The atoms 
are not necessarily of the same type; we denote by u 0 (r), u n (r) the wave 
functions of an electron in the normal and excited states in atom a , 
and by t; 0 (r), v n (r) the wave functions for an electron in atom 6. Since 
the overlap is neglected, it is not necessary to use antisymmetrical 
wave functions; we may locate electron 1 in atom a and electron 2 in 
atom 6, and take for the zero-order wave function of the system 

w o( f i) v o( r a)' 
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The interaction energy F(r x , r 2 ) between the two atoms is then 
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.p_ i 

U |r„- 


where r tt , r b denote the positions of the nuclei of the two atoms. 

If we use the perturbation theory of Chapter III to work out the 
change of the energy of the system of the two atoms due to F, the 
answer will obviously be zero; the integral 


| K(r 1 ,r t )|« # (r 1 )|*rfr 1 


is the electrostatic potential of an electron r 2 outside the unperturbed 
atom a. This will be zero since the unperturbed atom is spherically 
symmetrical. We have to carry the perturbation theory to a higher 
order of accuracy and calculate the energy using perturbed wave 
functions. 

By equation (10) of Chapter III the perturbed wave function is 

/- \ t. (n,n |V10,0) , , , . /eye> \ 


«o(r,KW+ T {22) 

Here W 0 , W n and W' {) , W n denote the energies of the normal and excited 
states of the two atoms and 

(n,n'\V\0,0 ) = JJ (^)^(ri,r 2 )M 0 (r,^ 0 (r 2 ) d^dT^ 

This means that owing to the interaction energy either atom may be 
perturbed and carry a dipole; but the position of the electron in one 
atom depends on the position of the electron in the other. The average 
dipole moment of either atom is zero, although the average interaction 
energy of the dipoles on each is not. 

If we use (22) to calculate the mean value of the interaction energy 
we obtain v |( M , W '|F|0,0)| 2 

Zw n +w' n .-w 0 -wV 

To calculate (n, v! | F10,0), we note that it simply represents the electro¬ 
static potential energy of two electrostatic charge densities 

eu*u 0 and ev% v 0 

in one another’s field. If these have dipole moments, the energy is, so 
long as It is great compared with the atomic diameter, 


^On'^yOn 2/on'}* 
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z is here taken to be along the line joining the two nuclei, and 

z o« = J u*zu 0 dr, z 0n . = J v*zv 0 dr. 

It is possible to choose the states n so that only one of z 0n , x Qn , y 0n does 
not vanish for a given state. Thus the interaction energy is 

£ ST *4n z'ol'+ x on Vgn; ±yLy & 

W 0 +W' 0 -W n -W' n . ’ 

and by symmetry this can be written 

&L 4 V _ _ (o-i) 


As in the last section, if for each of the atoms considered one transi¬ 
tion, of frequency v, v respectively, has an 'oscillator strength’ large 
compared with the others, the interaction energy between the two atoms 


may be written 



1 l_ 

jR 6 


(24) 


This term, therefore, will give the interaction energy between two 
molecules or atoms having strong absorption lines of frequencies v, v, 
all other lines being weak. 

The theory given here of the van der Waals attraction between 
molecules is due originally to F. London and R. Eisenschitz.f When 
the summations are carried out it is found that the van der Waals 
interaction energy between two hydrogen atoms is 
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volts, 


(25) 


where a is the atomic unit of length (0*53 A). 

The same result has been arrived at by Lennard-Jones J by the use 
of his generalized form of the Schrodinger perturbation theory. Denot¬ 
ing the two electrons by 1 and 2 and their distances from one nucleus 
by r and those from the other by p, the Hamiltonian of the perturbation 


term is 


\Pi r z r u R) 


If the distance between the two nuclei, 22, is large and it is assumed 
that the electrons are never very far from their respective nuclei so 


t F. London and R. Eisensehitz, Zcits.f. Physik , 60 , 491 (1930). Soe also the account 
given by H. Bethe, Handh. d. Physik , 241 / 1 , 547 (1933). 

J J. E. Leonard-Jones, Proc. Roy. Soc. A, 129 , 598 (1929). (§ 18.4 above.) 
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that r v p 2 are small in comparison with r 2 , p v and R, then by expanding 
the expression (26) for ll x in the form 

H i = — co8(r 1; R)cos(p 2 , RJ-cosfo, 

and applying his perturbation theory Leniiard-Jones derived the 
formula (25). 

Although the calculation of the long-range forces between hydrogen 
atoms is of great theoretical interest it is of little practical importance. 
The practically important case of helium atoms has been considered 
by Baber and Hassef by a method involving various refinements of 
the London-Eisenschitz method. Calculations on the van der Waals 
interaction between heavier atoms have been performed^ and are found 
to lead to results in reasonably close agreement with the experimental 
observations. 

The calculation of the van der Waals force between two hydrogen 
molecules lias been carried out by Massey and Buckingham^ using a 
variational method. Particular interest is attached to this calculation 
because of the fact that the hydrogen molecule being axially sym¬ 
metrical possesses a permanent quadripole electric moment. Part of 
the force between two hydrogen molecules is therefore made up of the 
static interaction between these quadripole moments. It is found that 
the interaction energy in this case is 

T7 _G(«,jM) F(a,p 9 0) 

V R* ~ R« 9 ( ' 

where the angles a, j3, 6 determine the fixed orientations of the two 
symmetric diatomic molecules. Of the two terms the second is the 
dynamical dipole-dipole interaction of the type which gives rise to the 
van der Waals force between two hydrogen atoms in 5-states, and 
the first the quadripole interaction referred to above. It is found that 
whereas the average of F over all orientations is finite, that of G 
vanishes, so that the static force is not of predominant importance in 
gas-kinetic phenomena. 

The interaction of the dipole system of one atom with the quadripole 
system of another has been considered by various authors. || It is found 

t T. D. H. Baber and H. R. Hass6, Proc. Camb. Phil . Soc. 33, 253 (1937). 

t J. G. Kirkwood, Phys. Zeits. 33, 57 (1932); H. Hellmann, Acta Physicochim. 
USSR. 2, 273 (1935); R. A. Buckingham, Proc . Roy. Soc. A, 160, 94 (1937). 

§ H. S. W. Massey and R. A. Buckingham, Proc. Roy. Irish Acad . 45, 31 (1938). 

(II H. Margenau, Phys. Rev. 38, 747 (1931); R. A. Buckingham, loc. cit.; J. E. Mayer, 
J. Chem. Phys. 1, 270 (1933). 
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that this interaction gives rise to a term proportional to r~ 8 which is 
appreciable for moderate distances; for instance, Mayer calculates that 
the dipole-quadripole interaction amounts to 20 per cent, of the dipole- 
dipole interaction in the energy of certain crystals (NaCl, KC1, KI). 


32.5. The calculation of the van der Waals constants from observed 
polarizabilities 

In the investigations of Kirkwood, Hellmann, and Buckingham, cited 
above, on the van der Waals interaction between heavy atoms, a great 
deal of attention was devoted to attempts to relate the constant c of 
the van der Waals energy (— c/r 6 ) to the coefficient of polarizability. 
The previous calculations were improved by Buckingham,! who made 
use of wave functions calculated by the method of self-consistent fields, 
and included the effect of electron exchange. 

For an atom with N electrons in a single shell, Buckingham found 
that the polarizability of the atom can be written in the form 


9 N 


(H*)W, 


(28) 


where a is the radius of the first Bohr orbit and R 2 is a certain mean 
of the square of the radius of the outer orbits measured in atomic units. 
Introducing a mean radius of the outer orbits in centimetres by means 
of the equation 


we can write equation (28) in the form 

^4 No* 
9a* 


(29) 


The constant, c, of the van der Waals field between two unequal atoms 
(2^ and N 2 ) is found by Buckingham to be related to the cr, thus defined, 
by the equation 

“ — (30) 


4 / Nj N 2 e 2 \ o\oj 
9\ a /erf-fag’ 


or, in terms of polarizabilities 
c = | ah 2 


<*i ot 2 




(31) 


The formula (31) is readily generalized in the case in which there are 
several shells each with a characteristic size and polarizability of the 


f R. A. Buckingham, loc. cit. 
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(n v i x ) sub-group of electrons in the atom 1 by oi(n v l x ) and the number 
of atoms in it by v(n v l 1 ) i then formula (31) is replaced by 

c _ 3 a l e 2 _ <x( / n'vh) <x ( n 2>h) _ / qo \ 

Z* Ji)}+V{ a (^2> h)l v ( n 2’h)} 

where the summation extends over all pairs of sub-groups. 

By means of this formula the van der Waals force constants can be 
calculated from observed polarizabilities. A series of values of e was 
calculated by Buckingham and is shown in Table 4 below. 


Table 4 



c 


c 


c 

Interaction 

.. .. 

(10 ~' ao ergs cm. 6 ) 

Interaction 

( 10 ~ 60 m /* cm. 6 ) 

Interaction 

(10 ~ 60 crgfi cm*) 

He—He 

1*63 

A—A 

63-5 

N» + —Na + 

3-70 

He—Ne 

3-48 

A—Kr 

92-7 

Na*'—K + 

10-4 

He—A 

9-89 

A—Xe 

135-5 

Na + —Rb 4 

15-1 

He—Kr 

14-4 



Na + — Cs f 

21-6 

He—Xe 

20-7 

Xe—Xe 

293 

K<—K+ 

33-3 

Ne—Ne 

7-48 

Li *— Li^ 

0-097 

K+—Rb* 

48-4 

Ne—A 

20-5 

Li+—Na+ 

0-584 

KCs-> 

71-3 

No—Kr 

300 

Li+—K+ 

1-51 



Ne—Xe 

42-6 

Li+—Rb f 

2-21 

Rb f —Rb + 

70-5 



Li*—Cs + 

314 

Rb + —Cs + 

104 

Kr—Kr 

136 


' * 



Kr—Xe 

199 

•• 


Cs+—Cs 

155 


33. Overlap forces 

In the last sections we have considered the van der Waals forces 
between atoms, which arise when the overlap between the wave func¬ 
tions is negligible. All other forces that occur between neutral atoms 
may be classed ‘overlap forces’ and arise only when the wave function 
of one atom overlaps that of the other. Among overlap forces can be 
included: 

(a) The valence forces of chemistry. 

(b) The forces of repulsion which always set in when atoms or ions 
with a closed shell configuration approach closely. 

Both types of force may be calculated by the same methods; two 
methods, in particular, are available. These are: 

(i) The method of London-Heitler. 

(ii) The method of molecular orbitals. 

Neither method will, of course, give an exact answer, but both can be 
treated as approximate methods from which a start can be made to 
build up more accurate wave functions. 
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In either method, the Hamiltonian H of the electrons in the molecule 
must first be written down; an approximate wave-function \P must then 
be found, and the energy of the molecule calculated from the formula 

W = J '¥*H'¥ dr. (33) 

The methods differ in the form taken by the approximate wave 
functions. 

The difference can be seen most simply in the case of a diatomic 
molecule, and the hydrogen molecule H 2 will be considered first. In 
the method of Heitler and London wave functions are set up for an 
electron in one atom in the absence of the other atom. Thus in hydrogen 
let 0 a (r) denote the wave function for an electron in one of the atoms 
of the molecule in the absence of the oilier, and ip b (r) the wave function 
of an electron in the other. In the original paper by Heitler and 
Londont on the hydrogen molecule H 2 , i/j u and i/j 0 were taken to be 
wave functions for electrons in the ground states; thus if the two nuclei 
are located at points R a , R h and r a denotes the vector r--R a and r b 
the vector r— R b , then 

i/j a (r) =n ce- r «/«", if, h (r) = ce~ r */ a ", 

where c is the normalizing constant. In other cases, in particular in 
considering the carbon bonds, the wave functions will not necessarily 
refer to the normal state, since a lower value of W, and hence a better 
approximation, can be obtained otherwise. 

We then set up our approximate wave-function X V; it must have the 
required antisymmetry in the coordinates of the two electrons, and will 
thus have one of the two forms (according to the arguments of Chap. V) 

Y = (34a) 

Y = B[U*i)M r,)-^,^)]^,*,). (346) 

Here Xa( s v 8 2 ) i 0 an antisymmetrical wave function in the coordinates 
of the two spins, giving therefore a singlet for each set of orbital wave- 
functions tff a , \ft b when the energy W is calculated from (33); 
is symmetrical and gives a triplet series of terms. A, B are normalizing 
factors. 

If the energy W of the molecule is now calculated according to (33) 
using the wave functions (34a), (346) respectively, we find 

W = -4 2 {(2+J} in case (34 a) 

== B 2 {Q—J} in case (346), 


t W. Heitler and F. London, Zeits.f. Physik, 44, 456 (1927). 
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Q = // 'f , a( r i)'Pb( r 2)H'l'a( r i)'l>b( r 2) dr l dr 2 

^ = // <f‘a( r i) ) l J b( r 2)H'Pa( r 2)<l'b( r i) dT r d,T 2 . 

The integral J is called the ‘exchange integral’; for hydrogen it is 
negative, and varies with distance more rapidly than Q. It thus leads 
to attraction and to the formation of a stable molecule for the singlet 
state (34a) but not for the triplet states (346). 

In the method of molecular orbitals,f one starts with wave functions 
for the individual electron in which each electron is supposed to be 
shared equally between the two atoms. Wave functions can thus be 

86t UP *«(r) = Ur)+h(r) (35) 

or ^(r) = i/i a (r)—il/ b (r), (36) 

where ip n and ip b are as defined above. Alternatively wave functions 
may be obtained which are a better approximation to the problem of 
an electron moving in some sort of self-consistent field for the whole 
molecule. One then builds up the molecule, putting not more than two 
electrons into each state corresponding to the wave-functions i/j a > 
iftp, etc. 

The method of molecular orbitals is in general very suitable for 
handling the excited states of molecules and for the discussion of com¬ 
plex molecules; for the discussion of metals it is at present the only 
practicable method, the London-Heitler approximation proving too 
difficult for the mathematical methods now available (cf. Chap. VIII). 
It is probably not so suitable as the starting-point of detailed numerical 
work. 

The reason for this may be seen by setting up the wave functions 
for the lowest singlet and triplet states in hydrogen; these are, using 
the method of orbitals, 

</ , a( r i)^a( r 2)X^( a l> S 2) ( 37 ) 

and [^ a (ri)«A J 8(r 2 )-^ a (r 2 )^(r 1 )] Xs (s 1> s 2 ). (38) 

The product ^ a (ri)^ a (r 2 ) ma y written 

This differs from the wave function (34 a) in including the terms 
^ a (r 1 )^ 0 (r 2 ), tf/ b (r 1 )if/ b (r 2 ) which represent both electrons in the same 
atom. The method of molecular orbitals thus allows electrons with 

f The name is due to J. E. Lennard-Jones {Trans. Faraday Soc. 25, 068 (1929)); he 
and F. Hund {Zeits.f. Physik , 51, 759 (1928), ibid. 63, 719 (1930)) first introduced it. 
It has been extensively developed by R. A. Milliken {Rev. Mod. Phys. 4, 19 (1932)) and 
by Coulson (refs. pp. 189, 192). 

&072 
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opposite spins to come into the same atom, and when the energy is 
calculated a large term will arise from the interaction between them. 
This term is certainly greatly over-estimated in the orbital method; the 
best approximation to the wave function would be of the type 

Hi'a( T lW a ( r 2)+M r l)M r 2)}+ta( r l)fa( r 2)+ta( r 2)M r l)’ 

with k between zero and unity. 

On the other hand, the wave function (38) for the triplet states 
reduces exactly to the same form as in the London-Heitler approxima¬ 
tion (346). In the orbital model, then, the symmetry of the wave 
function prevents two electrons with the same spin from coming into 
the same atom. 


\~33.1. Detailed discussion of the hydrogen molecule by the London - 
fteitier method f 

In this section a discussion is given of the hydrogen molecule in its 
lowest electronic state treated according to the approximation of 
Heitler and London. The discussion of the last paragraph shows that 
in its lowest state the spins of the two electrons have asymmetrical 
wave function and must therefore be antiparallel, that the ground state 
is therefore a singlet (spin moment zero), and that the energy of the 
molecule is given by formula (33). In this section we have, therefore, 
only to set down the Hamiltonian and work out the normalizing factors 
A, B and the integrals Q, J. 

The normalizing factor will be discussed first. The wave functions 
^ a (r), ift ft (r) will be taken as normalized to unity. Then it is easily seen 

that A 2 - ]/(2+2A 2 ), IP - I/(2—2A 2 ), 


where 


A = / Ur)Ur)dr. 


We have thus to calculate Q, J 9 and A. 
The Hamiltonian for the system is 


-£< v *+v»- 


e 

Hi Hi r, 


— r—r—r—F“+ 


a2 


Hi 


'12 


R, 


(39) 


(40) 


ab 


Here r nl denotes the distance between the nucleus a and the electron 
with coordinate r x ; r 12 is the distance between the two electrons, and 
R ah the distance between the nuclei. The term in e 2 /R ab ensures that 
W will give the total energy of the molecule, including the repulsion 
between the nuclei. 


f For further details see, for example, Handbueh der Phyaik , vol. 24/1, 535 (1933); 
Pauling and Wilson, Introduction to Quantum Mechanics , p. 340 (Cambridge, Mass., 
1935). 
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The simplest form to take for the wave-functions i/f n (r), i/t h (r) is that 
of the unperturbed hydrogen atom; this was the course adopted in the 
original paper of London and Heitler. With 
these wave functions the integral can be 
evaluated to give for the energies W s , W A of 
the symmetrical and antisymmetrieal states 

H4-2b;, + £ + M., ( 4.) 


W __ OW I e2 ] ^11 ^12 

Wa - Wh+ TQ + T=a*~ 



(42; 


Fig. 28. Coordinates used for 
the hydrogen molecule. 


(43) 


where W H is the energy (—me 4 / 2fi 2 ) of the 

ground state of the hydrogen atom, A is defined by equation (39) and 

hence is , » 

A = -- e -(r<n-\ni)l a t J T 

rra 3 J 

and W u and W X2 denote the integrals 

W n = e 2 f f f-(r ax mrjy - -- - r -) dr, dr 2 

J J V 12 r a2 W 

— —- f f e -i-L\ dr ( 44 ) 

«* 3 JJ Vu r a2 r bll 

w n = |e 2 f f Mr a iWr a 2 Wr bl )t(rJl-- L_ ±_l)rf Tl rf T8 

J J v 12 r «l r «2 r bl r bV 

= - — - f f I r62) / rt (—— — -— - dr 1 dr 2 . (45) 

2^ 3 jj o 12 r fll r a2 r hl rj 1 

The problem is thus reduced to the evaluation of the integrals A, W 1V 

w 12 . 

To evaluate the integral A we introduce confoeal elliptic coordinates 
rj defined by the relations 

e = r -4-> v = ( 46 ) 

M ab n ab 

and the azimuthal angle (f>. The variables £, 77 so defined lie in the ranges 
1 < f ^ oo, —1 < 77 ^ 1 and the volume element becomes 

dr = lI&(P-7f)d{drM. (47) 

Substituting from equations (46) and (47) into (43) we obtain 
00 1 

A = §J er**6'di J (f*- v *)d v = (l +“ + §) e-/W “ ( 48 ) 

1-1 
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In the expression (44) for the integral W u the first term may be evaluated 
by the use of the elliptic coordinates £ and 77 , the second and third 
directly. The final result is 

1 Bh\ 

6 a 2 




(49) 


The integrals involved in the evaluation of W 12 are cumbersome; indeed, 
that involving l/r 12 presents some difficulty. Heitler and London 
developed an approximation for W 12 , its value was later derived by 
Sugiura,f who found the expression 

soft iia 

7.2 > ~ 


w v 


I W ' ft ' 


+ 


w.!U 

*r,A v 


\ 8 1 20 
R„ 


15 a 3 


I 1 M 

15 a 3 j 


+ 


y -flog + A'*E i | 
lation (48), 

■ (■ * 


-2AA'EiJ- 2 ^JJ, 


(50) 


where A is given by equation (48), A' by the equation 

R%\ 


+ 


3 a 3 ) ’ 


y is Euler’s constant, and Ei denotes the integral logarithm 

z 



Ei(Z) = 


/? 


dt (Z < 0 ). 


Fig. 29. Energies of the symmet¬ 
rical and anti symmetrical states of 
H t as a function of the distance 
between the atoms. 


Substituting from equations (48), (49), 
and (50) into equations (41) and (42) we 
obtain expressions for the energies W s 
and W A as functions of R ab . The varia¬ 
tion of these functions with R ab is 
shown in Fig. 29; the function W A de¬ 
creases monotonieally while has a 
minimum at R ab /a =1*518. W A corre¬ 
sponds then to repulsion at all distances 
so that a stable molecule is not formed. 
On the other hand, PFg corresponds to 
the attraction of the two hydrogen atoms with the formation of a stable 
molecule. The equilibrium value of the relative position of the nuclei 

is given by = l-518a = 0-80 A, 

which is in excess of the known experimental value 0*7395 A. The 
corresponding theoretical value of the heat of dissociation is 3*14 eV 


t Y. Sugiura, Zeits.f. Physik, 45, 484 (1927). 
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in contrast to the observed value 4*48 eV. The vibrational frequency 
of the nuclei, determined from the curvature of the function in the 
neighbourhood of its minimum, comes out as 4,800 cm.- 1 ; the value 
obtained from observations of band spectra is 4,405*3 cm.” 1 


33.2. Improvements in the approximation of London and IIeider 

A simple improvement is to use for tf/ a (r), *fj h {r ) not the wave functions 
for an electron in an unperturbed hydrogen atom but the wave functions 
for an electron moving in the field of a charge z , which will then be 
chosen to minimize the energy for each value of R ah . This was first 
done by Wang,f who found a valuo of z , at the interatomic spacing 
giving minimum energy, given by z = 1*106. 

The addition to the London Heitler wave functions f ^Fpolar wave 
functions corresponding to electron configurations H+H” and H~H+ has 
also been considered by Weinbaum. J Weinbaum takes a wave function 
of the type (38 a) discussed above, with ip(r) as before the Is wave 
function for an electron moving in a charge ze . He finds z = 1*193 and 
Jc = 0*256 for the values of the parameters giving minimum energy. 
The fact that k is less than \ shows that the London-Hoitler is a better 
approximation than that of molecular orbitals, which gives k = 1, for 
the hydrogen molecule. 

Other improvements in the wave functions are: 

(a) That of Rosen,§ who takes for 0(r) a mixture of 6* and p functions 
cf the type 

He finds a — 0*10. 

(b) Further work by Weinbaum (loc. cit.), who combines this refine¬ 
ment with that of polar states. He finds 

= 1*19, a = 0*07, k = 0*176. 


Table 5 


Heitler-London .... 
Wang. ..... 
Weinbaum (ionic terms) 

Rosen (addition of p terms) 
Weinbaum (ionic terms and p terms) 
James and Coolidge 
Experiment .... 


Heat of 

dissociation (eV.) 

^ab 

(A) 

Vibrational 
frequency (cm. l ) 

314 

0*80 

4800 

3*76 

0*76 

4900 

4*00 

0*77 

4750 

4*02 

0*77 

4200 

4*10 



4*722 

0*74 

.. 

4*477 

0*7395 

4405*3 


t S. C. Wang, Phys . Rev. 31, 579 (1928). 
t S. Weinbaum, J. Chem. Phys. 1, 593 (1933). 
§ N. Rosen, Phys. Rev. 38, 2099 (1931). 
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(c) A much more complicated wave function set up by James and 
Coolidge,f in which terms in r 12 are included. 

The results of these calculations are given in Table 54 
For further details the reader is referred to the original papers, or 
the accounts by Pauling and Wilson (loc. cit.). 

33.3. Some other applications of the London-Heitler method 
Repulsive forces between atoms and ions with the closed shell configura¬ 
tion . Only for helium and neon have detailed calculations of the inter¬ 
action energy between two atoms with the closed shell configuration 
been made. The overlap energy is found to be repulsive at all distances. 
The reason for this may be seen qualitatively as follows: each electron 
in one helium atom will interact with two electrons in the other, one 
of which has spin parallel to its own and one which has spin antiparallel. 
As regards the interaction between the electrons with parallel spins, 
we know that the spin wave function must be symmetrical and the 
orbital function therefore anti symmetrical. The contribution to the 
energy of the molecule is thus of the form 

Q+J- 

As regards the interaction between electrons with antiparallel spins, the 
spin wave function is as likely to be symmetrical as an Asymmetrical, 
and the contribution to the energy of the molecule is thus of the form 

The total energy of the pair of interacting atoms is thus 

2 Q+ J. 

The exchange integral J being negative, repulsion between the mole¬ 
cules results at all distances. 

Detailed computations of the interaction between two helium atoms 
along these lines have been made by Gentile§ and Slater. || Slater 
obtains a complicated expression for the interaction energy that can 
be approximated closely by 

1]2 

7-7 X 10- 10 e- 2 ' 43 ^o, a 0 = —- = 0-529 A. 

me 2 

This expression, added to a term giving van der Waals’ attraction, has 

t H.M. James and A. S. Coolidge,«/. Chem. Phys. 1, 825 (1933). 
j From Pauling and Wilson, The Structure of Band Spectra. 

§ G. Gentile, Zeits. /. Physik , 63, 795 (1930). 

|| J. C. Slater, Phys. Rev . 32, 349 (1928). 
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been extensively used for the calculation of the equation of state of 
helium gas.f An expression for the complete interaction energy given 
by Slater and Kirkwood is 

7 * 7 £- 2 * 43 / 2/00 — 7 ~": - J lO“ 10 ergs. 

(i?/a 0 ) 6 J * 

This curve gives a shallow minimum of depth 
10 -15 ergs (6x 10~ 4 eV) at R czt 2*9 A. It is 
not certain whether the ‘potential hole* is 
deep and wide enough to permit the forma¬ 
tion of a stable molecule. 

Computations have also been carried out 
for neonj using the same method. For the 
repulsive potential is found 

1-18X10 4 e~W' 395 eV, 
where R is the distance between the atoms in atomic units. 

33.4. The hydrogen molecule by the method of molecular orbitals 

The elements of the method of molecular orbitals have been discussed 
above; the wave functions of the individual electrons are taken to be 
the solutions of some Schrodinger equation 

W a +^(W-V)+ a = 0, 

where V is some field which is the same for all the electrons. The 
essential points about the approximation are thus: 

(a) that each electron is thought of as being shared between all the 
atoms of the molecule, its position being uncorrelated with the 
positions of the others (at any rate those of opposite spin); 

(b) the wave functions for the individual electrons are orthogonal 
to each other. 

To illustrate the method we shall consider the ground state of the 
hydrogen molecule.§ Suppose we denote by </>(!) the molecular orbit 
for electron 1 in the field of the two nuclei A and B , then in the 
molecular orbital treatment we write for the wave function of the 
hydrogen molecule # = * W( 2 )x>1 ,(61) 

f Cf. W. H. Koesom, Helium (Amsterdam, 3 942); J. de Boer and A. Michels, Phymca, 
6, 409 (1939); H. S. W. Massey and R. A. Buckingham, Proc. Hoy. Soc. A 168, 378 
(1938); E. Beth and G. E. Uhlenbeck, Physica, 4, 915 (1937); J. C. Slater and J. G. 
Kirkwood, Phys. Rev. 37, 682 (1931). 

t W. E. Bleiok and J. Mayer, J. Chem. Phyn. 2, 252 (1934). 

§ C. A. Coulson, Tram. Faraday Soo. 33, 1479 (1937). 
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where x^( a i>* 2 ) denotes the spin wave function which is usually omitted 
since it makes no further contribution either to the wave function or 
to the energy. The figures 1 and 2 stand for r x and r 2 . 

The molecular orbital method therefore depends on the solution of 
the wave equation for a single electron moving in the field of two fixed 
protons. The solution of the problem of the hydrogen molecular ion 
(H,+ ) is thus fundamental to the discussion of molecular structure by 
the method of molecular orbitals and must be investigated first. 

The Hamiltonian of the system may be written 

■lj 2 p2 p2 p 2 

J1)\ T a Tjj -^ab 

To solve the Schrodinger equation 

H<f> = W4> (52) 

we transform to the elliptic coordinates 77 , by equations (46) with 
azimuthal angle x • I * 1 these coordinates 

1 1 1 4 £ 

r„ H 

Introducing the radius a 0 of the lowest orbit of the Bohr atom 
(a 0 = h 2 lme 2 ), transforming the Laplacian to these coordinates, and 
writing « 

W = -t-(l+w), 

K db 

we find that equation (52) reduces to 


d_ 




+ 2 , 


= 0 . 


This equation is separable; writing 

- cf> = x(mv)®(x)> 

we find that the functions X , Y, and 3> satisfy the ordinary differential 
equations 


d x 


2 + A^-O 


d 

d 


r^-.+H r = 0 ' 
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where A, /z are constants introduced when the equations are separated 
and the functions /, g are defined by 

m = ^ (M 2 +£)+/*, g(v) = 

In tho lowest possible electronic state A = 0, so that writing 


we have 


}_AX 
X d$’ 


1 dY 
Y dr)’ 


(64) 

dr) r) £ —\ 

The equations (54) have been integrated numerically by Burrau.f 
The range of the variables isl^^<oo ,—-1 < 77 ^ -\- 1 ; by assigning 
a value to the ratio R ab /cto the solution of the equations can be 
expanded in a power series for 0 < rj ^ | and in another series for 
l ^ V ^ 1 . The two series have the same sum at rj — \ only if w has 
a certain value. In this way w (and hence W) is determined for each 
value of the internuclear distance R ab ; the results obtained by Burrau 
are shown in the table below. The accuracy of Burrau’s calculations 



has been improved upon by HylleraasJ who assumed a solution of the 
form 

Y(v) = I c n PL Xl (v), 

»“|A| 

and also by Steensliolt.§ These treatments correct Burrau’s values by 
very small amounts; all three give R ab = 2*OOa 0 for the equilibrium 
value. This corresponds to a value R ab = 1*06 A in agreement with 
Richardson’s spectroscopic estimate|| of the distance between the pro¬ 
tons in the ion. 


A method such as that of Hylleraas is necessary to obtain accurate 
forms for the wave function <j> for the problem; it is found, however, 
that by taking <f> to be a linear combination of atomic orbitals it is 
possible to construct a wave function which is sufficiently accurate for 
molecular calculations. If 0 a , \jj b denote the wave functions of the 
electron treated in atoms A and B } supposed normalized to unity, then 


one may take 


<f> = A'l>a( r a)+ B M r b)- 


t O. Burrau, Kgl. Danske Vid. Selshab. 7, 14 (1927). 
t E. A. Hylleraas, Zeits.f. Physilc, 71, 739 (1931). 

§ G. Steensholt, ibid. 100, 547 (1936 )•, Norske Vid.-Akad. Avh. No. 4, 1936. 
|| O. W. Richardson, Trans. Faraday Soc. 25, 686 (1929). 
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The constants A and B are found by minimizing the energy 
W = J dr/ J </>*<!> dr; 

in this way we find A = ±B = (2 ; J I 2A) _ *, 
giving for the wave functions 

' t “;pT2Aj <# “ + ' w 

“ d 

The form of these two wave functions is shown in b'ig. 31. 


(55) 

(56) 

(57) 



Fig. 31. 


For the wave function ifj a we shall assume the form 



differing from the wave function of the ground state of the hydrogen 
atom by the inclusion of a ‘screening constant’, c, which will be a 
function of the nuclear separation R nb . Substituting from (58) into 
equation (56) and inserting in equation (55), we obtain for the energy W , 

W = -(fc*-Gc+ -?£-), (59) 

a 0 \ R(ibl 

where F(p), G(p) are known functions of the variable p = cB clb /a 0 ; the 
expressions for F, G are rather cumbersome but may be evaluated 
in the same way as the integrals in § 33.1. For a given nuclear separation 
\ R ab the best value of c is given by the equation 


(SL=i 


the suffix R ab denoting that is assumed constant during the 
differentiation. It follows immediately from (59) that c is a root of the 

“ .(»+*") - ( 60 ) 
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The values of c determined from this equation are shown in Fig. 32, 
and the corresponding curves for W as a function of are given in 




Fig. 33. Also shown on the same diagram are Pauling’s curves obtained 
by taking c = 1 in equation (58) and inserting this wave function in 
(55) with the combinations (56) and (57), and the curves obtained by 
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Hylleraas and Steensholt from the accurate forms of <f> in spheroidal 
coordinates. 

It appears that the simple forms (56) and (57) approximate quite 
well to the exact wave functions for the molecular hydrogen ion derived 
from the solution of the Schrodinger equation in confocal elliptic 
coordinates. 

$Let us now consider the hydrogen molecule for which the Hamiltonian 
is (40), (cf. Fig. 28). In the molecular orbital method the wave- 
function *// is of the form (51). Dropping the spin factor and substituting 
for ^(1) and <f>( 2) from equation (56) we have 

•A = K i{ l H r ai)+'P( r bi )}{*A( r «2)+ i>{r h2 )} 

= K l{H r al)U r a2) + H r bl)4‘{ r ai)+4 , { r a l)4>{ r b2)+H r bl)'l i ( r b2Yu ( 61 ) 
where K x denotes a normalization factor. 

Using the wave function (61) with i/*{r al ) defined by (58) we find that 
the energy can be put in the form 

W = -{2Fc*-2Gc+Jc + -^\ (62) 

a 0 \ ti (lb ) 

where F and G are the same functions of p as in the case of the molecular 
hydrogen atom, and J is another function of the same type arising from 
the presence of the term l/r 12 in the Hamiltonian (40). Applying the 
variation method as before we find that c is a root of the equation 

2c K'’f) - (>«+**■ 

Calculating the explicit forms of F, G, J we can draw a curve of the 
type Fig. 34 which gives c as a function of The values for the 

ion H^ are also shown on the same scale. It will be seen from this 
diagram that the repulsion between the two electrons in the neutral 
molecule tends to diminish the value of c . 

The corresponding curve for the energy (62) is shown in Fig. 35 
together with the curve obtained by Coulson using the Heitler-London 
method with the variation principle. The constants of the hydrogen 
molecule obtained by these methods of approximation are shown below. 
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It would seem that in the simple case of the hydrogen molecule the 
molecular-orbital and the electron-pair treatments are almost equally 
good. It is found, however, that in more complicated polar molecules 



Fig. 34. rasa function of B tlh for II 2 and H.}. 



Fig. 35. H 2 molecular energy curves. 

the molecular-orbital method is superior to the electron-pair (or Heitler- 
London) method. The method of molecular orbitals has been applied 
to a few molecules, f 

t See, for example, C. A. Coulson, Trans. Faraday Soc. 33, 388 (1937) (CH 4 ); Proc . 
Comb. Phil . Soc. 31, 244 (1935) (He«J); Proc. Boy. Soc. 169, 413 (1939) (polyenes and 
aromatic molecules) ; J. K. Knipp, J. Chem. Physics , 4, 300 (1936) (LiH + ); H. M. James, 
J. Chem. Physics , 3, 9 (1935) (Li 2 f ); C. A. Coulson and G. S. Rushbrooke, Proc . 
Camb. Phil . Soc. 36, 193 (1940) (unsaturated hydrocarbons); J. E. Lennard-Jones and 
C. A. Coulson, Trans. Faraday Soc. 35, 811 (1939) (polynuclear hydrocarbon molecules). 
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34. Distribution of momentum in the hydrogen molecular ion 

A great deal of theoretical work has recently boon done on momen¬ 
tum distributions in molecules.f In addition to its intrinsic interest 
the determination of the momentum distribution of the electrons in 
a molecule is of importance since it determines 

(i) the shape of the Compton line when X-rays are scattered from 
the molecule;J 

(ii) the distribution of tho fast inelastically scattered electrons from 
the molecule.§ 

Since these quantities can now be determined experimentally with a 
fair degree of accuracy it is important to attempt some kind of theo¬ 
retical treatment of the distribution of momentum in molecular systems. 
Although Coulson and Duncanson have discussed the momentum 
distribution in several molecules we shall confine our attention to that 
in the molecular hydrogen ion H^. 

According to transformation theory, the momentum wave function 
x(p) of the molecular hydrogen ion is obtained from its space wave 
function <f>( r) by means of the Fourier integral 


x(P) = 


h* 



(63) 


where for the symmetrical wave function we may assume <f> is given 
by equation (56) with the 0’s given by equation (58). If we use atomic 
units (Chap. VI above) and denote the position vectors of the nuclei 
by r (l , then (63) assumes the form 


p-i(P.r a )\ p/ -i{p.r b ) 

*<»- V(2 -; 2A, 

where A(p) = ^, f 
The density function is then 

x(P)x*(P) = — 


(64) 

(65) 


( 66 ) 


Now the function A(p) is simply the momentum distribution of the 
atomic orbital (58), so that the density (66) is the same as it would be 


f B. Hicks, Phya. Rev. 52, 436 (1937); C. A. Coulson, Proc. Camb. Phil. Soc. 37, 55, 
74 (1941); W. E. Duncanson, ibid. 37, 397 (1941); C. A. Coulson and W. E. Duncanson, 
ibid. 37, 67, 406 (1940), ibid. 38, 100 (1942). 

t J. W. M. Du Mond and H. A. Kirkpatrick, Phya. Rev. 52, 419 (1937). 

§ A. L. Hughes and M. M. Mann, ibid. 53, 50 (1938). 
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if the electron were confined to one of the nuclei, except for the factor 

1 icos(p.R^) 

1-fA 

which gives rise to a kind of diffraction effect and is consequently called 
the diffraction factor. 

The density function x(p)x*(P) given by equation (66) has symmetr 3 T 
around the internuclear axis, so it is desirable to draw the contours 
of equal momentum density in a plane through this axis. The result 
is shown in Fig. 36, from which it will be observed that the contours 
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\ 


Fig. 36. Contours of constant momentum density for H«f. 
(From Coulson, Proc. Camb. Phil. Soc. 37, 62 (1941).) 


are approximately ellipses with major axes perpendicular to the 
direction of the internuclear axis; there is thus greater probability in 
directions perpendicular to the bond than along it. 

The probability I x {p z ) dp x that the component of the momentum 
along the internuclear axis lias a value between p x and p x -\dp x in 
determined by the equation 


OO CO 

UPx) = J J x(P)X*(P) d Py d Pz = 

— 00 —oo 


8C 5 1 + COS RabPz 

Mi+W TpW 1 ? * 


(67) 


In the case of a hydrogen atom R ab — 0, c = 1, A — 1, so that 


/ x(Px ) 


8 

MpI+i?' 


( 68 ) 


The variation of the functions (67) and (68) with p x is shown graphically 
in Fig. 37, from which it is evident that in the molecular ion there are 
fewer large values of p x than in the atom; if we calculated the probability 
functions I y (p y ), I s (p B ) 9 perpendicular to the internuclear axis we should 
obtain precisely the opposite effect. 
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For the application of these principles to more complex molecules 
the reader is referred to the papers by Coulson and Duncanson cited 
above. 



Ftg. 37. Distribution of component of momentum 
along the intemuclear axis. 


35. Qualitative results for more complex moleculesf 

35.1. The principle of maximum overlapping 

The mathematical difficulties encountered in the calculation of the 
energies of molecular orbits, other than those of H 2 and H^, are so 
formidable that problems of molecular structure are usually discussed 
with the aid of a new assumption known as the principle of maximum 
overlapping . It is assumed as a first approximation that the exchange 
energy is due to the overlapping of the electron clouds surrounding the 
nuclei and that the energy of a molecular orbit is lowest (i.e. the binding 
energy is greatest) when the atomic orbits which it resembles overlap 
as much as possible. 

In applying this principle we require a knowledge of the probability 
distribution functions for atomic orbitals. We saw in the case of the 
hydrogen atom that a Is wave function is of the form 

W«)=Mr), 

so that the angular dependence of the wave function for an atomic 
orbital is of the type shown in Fig. 38 a. The angular dependence of 
the wave function 

Wvx) = -Mr) 

r 

for a 2p a .-atomic orbital is shown in Fig. 38 6. The squares of these 
functions give the angular dependence of the probability distribution 

f See L. Pauling. The Nature of the Chemical Bond , chap, iii (Cornell Univ.); C. A. 
Coulson, Proc . Roy . Soc . Edin . A, 61, 115-39 (1941-2). 



§ 35 QUALITATIVE RESULTS FOR MORE COMPLEX MOLECULES 193 


functions. The 2p y , 2p z orbitals are of the same form as the 2p x one 
except that the axis of symmetry is Oy , Oz respectively in place of Ox. 
In the 5-orbit the electron is practically confined within a sphere 
whereas an electron in a 2/;-orbit is most likely to be found around 
or within a region with the dumb-bell shape of Fig. 38. 



Fig. 38. Polar representation of absolute 
values of the angular wave function for 
ls-orbital and 2p-orbitals. 


The use of the principle of maximum overlapping is most simply 
illustrated with reference to the water molecule H 2 0. Each hydrogen 
atom is in a 15-state. The oxygen atom has the electronic structure 

(1 8f{2s)\2p)\ 

the two inner K -electrons keep close to the nucleus and do not affect 
the binding, while those filling the 25 atomic orbital form a closed sub¬ 
group round the nucleus. Of the four remaining electrons two fill the 
2p c -orbital and do not take part in the formation of molecules (since 
they are paired together); finally, there is one electron in the 2p x orbit 
and one in the 2p y orbit. They alone take part in molecule building* 
We assume further that the 2s and 2p wave functions do not mix. 
If this is so, then by the principle of maximum overlapping the greatest 
binding energy is obtained by placing one hydrogen atom along the 
#-axis and the other along the y-axis, as shown on the left in Fig. 39, 
which is taken from Coulson’s paper, f We see immediately that the 
angle between the bonds, HOH, should be about 90°; this derivation, 
however, ignores the mixing of the 2p and the 25 wave functions and 
the repulsion between the two hydrogen atoms. These corrections have 

f C. A. Coulson, loe. cit., Fig. 6. 

O 
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the effect of increasing the bond angle. The observed bond angles for 
water-type molecules (oxygen family) are:f 

H 2 0 


f 2 o 

C1 2 0 

h 2 s 



105° 

100 ° 

115° 

92° 



Fig. 39. Tho water molecule. 


35.2. Carbon molecules: hybridization of wave f unctions 
In the carbon atom two of the six electrons are in iT-shell orbits and 
take no part in chemical binding. The other four electrons are i-shell 
electrons whose orbits are chosen from 2 s, 2p x , 2 p y , 2p z . The nature 
of the bonds that carbon forms in any particular molecule determines 
the way in which these four orbits are formed. In carbon, since the 
energy of the 2s is nearly equal to that of the 2^-orbits, we may form 
compound, or hybrid, orbitals by taking linear combinations of the 2s, 
2p wave functions. Denote the latter wave functions by 0( 2s ), ijj(2p x ), 
ifj(2p y ), tp(2p z ) and assume they are orthogonal and normalized. From 
them form four new wave functions i/t r (r = 1,2,3,4) defined by the 

equations ^ = a r $(2s)+b r il>(2 Px )+C f 4,(2 Py )+d r <P(2 Pz ) (69) 


(r — 1,2,3,4), where the a’s, b’ s, c’s, and d’& are constant. The energy 
of the orbits corresponding to these wave functions is that of the nearly 
equal 2s and 2 p states. If we denote the direction cosines of a certain 
direction by = (l t , m if nf), then the wave function 


'K 2 Pi) = k 'l>(2Px)+ m i ^p y )+n { jj(2p z ) (70) 

is of the type ${2p) and has its maxima along the directions ( l i9 m i9 n { ) 
and (— l if ~m if With this notation we may write equations (69) 

in the form = a^(28)+b^(2 Pi ) (71) 


t A. F. Wells, Structural Inorganic Chemistry (Oxford, 1944). 
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(i = 1,2,3,4), and note that the maxima of this function are also along 
the direction r*. For the wave functions (71) we have 


and 


f <pi4>jdT =r- a i a j -\-b i b i J ip(2p i )^(2p j )dr 
j 'l>( 2 Pi) t l , ( 2 Pj) dr = Z, l, = cos 6 ip 


so that the orthogonality and normalization relations yield the ten 
equations 


®<®i+MiCos00 = 8 {j 


(72) 


(i,j = 1,2,3,4) for the determination of the eight constants b t . 
In this equation 6 ti denotes the angle between the directions r ; and r 4 ; 
cos 6 U is taken to be unity. 


Case (i). Tetrahedral bonds . If we take the directions r v r 2 , r 3 , r 4 
to be normal to the faces of a regular tetrahedron then, for i 

cos d if = — i, 

and equations (72) reduce to 

a 2 +6 2 = 1, a 8 —$6* -= 0, 

where a = = a 2 = a 3 = a 4 , & = = b 2 = b 3 = ft 4 . We thus have 

a = 1, b = |V3, 
so that the wave functions are 

= 4^(2ff)+iV3^(2p 4 ). (73) 

If we take as axes the lines joining the mid-points of the edges of the 
regular tetrahedron the vertices have coordinates of the type (1,1,1), 
(1,-1, —1), (—1,1, — 1), (—1,—1,1), and the wave functions (73) 
become 

<Ai = ¥{4 , ( 2s )+ , P( 2 Px)+ i l j ( 2 Pv)+ , f ) ( 2 i > z)} \ 

= i{'l'( 2s )+'P( 2 Px)—'l'( 2 P v )-H 2 Pz)} I , 74) 

h = m 2 *)-'tt 2 Px)+'H 2 P v )->l'( 2 Pz)} * 

= h{'l ) { 2s )-'l‘( 2 Px)—'l , ( 2 Pv)+'l'( 2 Pz)} j 

Functions of this type are called tetrahedral orbitals ; in each of them the 
electron is concentrated in a particular direction about which the wave 
pattern is symmetrical. The wave functions of this type have a form 
resembling that of Fig. 40. 

If, when it is ready to form a molecule, the carbon atom has its four 
unpaired electrons each in one of these tetrahedral orbitals, the principle 
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of maximum overlapping shows that it can form four equivalent bonds 
directed to the four corners of a regular tetrahedron whose centroid 
coincides with the carbon nucleus. The four bonds thus make angles 
of 109° 28' with each other. In this way we can account for the charac¬ 
teristic tetravalency of carbon—cf. the methane molecule CH 4 (Fig. 41). 
When the four radicals which are bonded to the carbon atom are not 



Fin. 40. Single tetrahedral 
orbital. 



Fk;. 41. The methane 
molecule. 


all identical, it can bo shown that their mutual repulsions and different 
electron affinities do not affect the tetrahedral angles by more than two 
or three degrees. The tetrahedral bond can also be shown to be very 
stable. 

Case (ii). Trigonal bonds. If we take the directions r v r 2 , r 3 to lie 
in a plane making angles 27 t /8 with each other and take r 4 to be normal 
to this plane, then 

COS0 12 = COS 0 23 = COS 0 31 —1, 

cos 6 U = cos 0 24 = cos 0 34 ~ 0, 
and so from equations (72) we have 

«i = a 2 — a 3 = 1/V3, 
b x — 62 — V|, 

a 4 = 6 4 = 0 . 

Choosing r 4 to lie along the z-axis and r x to lie along the ic-axis we then 
find «Ai = {«/-(2s)+V20(2 JJ J}/V3 

•An = {V 2 4>(2s) - t(2p x )+ V3 <A( 2 Pi/ )}/V 6 
•Am = {V 2 0 ( 2 «)—^( 2 jp x )—V3 ^A( 2 ^)}/V 6 
•Aiv = 'I'Wp.). 

These orbitals are called trigonal orbitals . They resemble the tetragonal 
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orbitals in the sense that the first three (which are shown in Fig. 42) 
possess a marked directional property. The 
principal directions lie in one plane and are 
separated by angles of 120°; the fourth orbital 
is an undisturbed ‘dumb-bell’ with its axis 
normal to the plane containing the other 
three. There is one electron in each of the 
trigonal orbits when a carbon atom is about 
to form an unsaturated molecule, such as 
ethylene (C»H 4 ), in which the bond angles 4 ~* Tho llm, ° <*o-plamir 

trigonal orbitals. 

are 120 . 



35.3. The\ ethylene double-streamers 

As an illustration of the occurrence of trigonal wave functions we 
shall now consider the structure of an unsaturated molecule, ethylene: 

H H 

x 

H H 

in which there is a double bond. The two carbon atoms are in the 
trigonal state so that we can form five ‘sausage’ bonds each of two 
electrons, such that there are three bonds from each carbon atom. 
Normal to the plane of these orbitals there is a dumb-bell orbital 
corresponding to a mobile electron which is unpaired and not engaged 
in the formation of a bond as yet. The criterion of maximum over¬ 
lapping demands that the bonds in the C-H 2 plane are at angles of 120° 
with each other; the best we can do with the mobile electrons from 
the two carbon atoms is to pair these together somehow. If the two 
CH 2 planes of the molecule coincide the two dumb-bells are in parallel 
directions so that they overlap most in this direction. These 2 p z 
electrons will then form themselves into molecular orbitals of the type 
shown in Fig. 44, in which the probability function has the form of 
two streamers one on each side of the 0 2 H 4 plane. For such a molecular 
orbital the wave function will be approximately 

<A = >l'a( 2 Pz)+'l , b(2pz)- 

Both electrons have, of course, the same ‘double-streamer’ wave func¬ 
tion; it should be noted that it is not a case of the upper streamer 
representing one electron and the lower streamer the other. The two 
streamers are inseparable, constituting the additional factor which 
converts a single bond into a double bond. 
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A characteristic feature of the double bond is its high reactivity. 
Since the streamer electrons are not so tightly bound as the other 
electrons—the atomic orbitals from which they are built do not over¬ 
lap as much—it is easier to form new molecules by disengaging them 
from one another and linking them with other atoms. 



Fig. 43. Overlapping of atomic 
orbitals in ethylene. 


(+) C3> 

—> *A B • 

0 < 3 ? 

Fig. 44. Mobile oloctrons in ethylene. 

Similarly, the triple bond is formed by the addition of a further pair 
of streamer electrons in a plane normal to that of the other pair of 
streamers. 

35.4. The benzene molecule (C e H 6 ) 

A molecule of some importance in organic chemistry is the benzene 
molecule and it is one of the classic problems of chemistry to determine 
the distribution of electrons in this molecule. Now a number of sources 
indicate that all the carbon and hydrogen atoms lie in one plane. We 
see then that for this to be possible each of the carbon atoms must be 
in a trigonal state. The atomic orbitals (Fig. 45) then form twelve 
‘sausage’ bonds and the molecule forms a plane ring containing six 
carbon atoms. There are then six mobile electrons left unpaired; their 
orbitals are dumb-bells whose axes pass through the carbon nuclei and 
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are normal to the hexagon formed by the carbon nuclei (Fig. 46). 
These electrons will, however, interact and will be free to move about 
from one carbon nucleus to another, forming molecular orbitals of the 
type shown in Fig. 46. The streamer bonds which we encountered in 
ethylene are now spread out over a wider space—we have two streamers 
going right round the molecule one on either side of the carbon hexagon. 



Fig. 45. Overlapping of atomic 
orbitals in benzono. 


Fig. 46. Mobile electrons 
in benzene. 


If we write i / j v «// 6 for the dumb-bell orbitals the molecular 

orbitals streaming round the molecule will, by an analogy with the case 
of H 2 , be of the form 6 

^ = 2>r&- 

r=i 

The constants c r have to be chosen (by a minimum energy method) 
so that this function is as good a solution of the wave equation as 
possible. It can be shown that there are six possible molecular orbitals 
of the required character each with different values for the c r and each 
having a different value for the energy. If E 0 is the energy of an isolated 
2p z atomic orbit and J8 is a certain constant, of negative value, whose 
magnitude is known from comparison measurements on ethane and 
ethylene, these molecular energies are 

Eo-\~W> EM Eo~~P> Eq—P> Eq—%P- 

The energies of the first three orbitals are thus less than that of an 
isolated 2 p z atomic orbit; these levels are called bonding orbits . There 
will be two electrons in each of these bonding orbits in the normal state 
of benzene so that their energy is 6U 0 -f 8j3. If we had supposed that 
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the molecule hod a configuration Jj in which single and double 

bonds alternate, the energy would have been 6i? 0 -f 6/3. The extra energy 
2j8 gained by removing the restriction to localization has been called the 
resonance energy. The value 2)8 is about 30 k.cals. which agrees quite 
well with the experimental value (37 k.cals.). 

Similar calculations to those we have outlined briefly for benzene 
have been carried out for other molecules of this type and have been 
found to explain many of the well-known facts of organic chemistry. 
For example, it has been shown that the characteristic colour of many 
of the organic dyes may be attributed to the excitation of a mobile 
electron from a bonding orbit to an unoccupied orbit with energy 
exceeding E 0 . The results of calculations of the resonance energies of 
a few unsaturated molecules are embodied in the table below.| It will 
be observed that the agreement with the experimental values for the 
resonance energies is reasonably good when we consider the nature of 
the approximations involved. 


Table 6 


Molecule 

| Resonance energy {k.cals.) 

Calculated 

Experimental 

Butadiene 

5 

5 

Hexatriene 

11 


Octatetraene 

17 


Benzene 

30 

37 

Diphenyl 

65 

87 

Naphthalene 

63 

75 


t C. A. Coulson, op. cit., p. 130. 
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36. Cohesive forces in ionic crystals 

The theory of the cohesive forces in ionic crystals developed by Max 
Born I and his collaborators was one of the first successful applications 
of atomic physics to the theory of solids. Though not strictly an applica¬ 
tion of quantum mechanics, it forms the basis of subsequent develop¬ 
ments, and it will be necessary to give an account of it here. 

The aim of Born’s work is to account for the lattice energy, lattice 
constant, crystal structure, and elastic properties 
of ionic crystals in terms of an atomic model. 

The theory has been applied most successfully to 
alkali halides, and will be explained here with 
reference to them. Alkali halides have the simple 
cubic structure; this is shown in Fig. 47. 

The theory worked out by Born rests on the 
following assumptions: 

(a) The crystal is built up of positively and 

negatively charged ions, carrying charges re- Fig. 47. Sodium chloride 
spectively. Ions of unlike sign therefore attract structure. 

each other and ions of like sign repel, the force in each case being equal 
to e 2 /r 2 . 

( b ) Any two ions repel one another with an additional force that is 
practically equal to zero until their separations fall below a certain 
value, when the repulsion increases rapidly. We shall denote the 
potential energy due to this repulsion by w(r) and call it the overlap 
energy, because it only comes into play when the electronic clouds of 
the ions overlap. The total potential energy of a pair of ions of unlike 
sign is thus 

w(r)—e 2 /r, 


• No 
0/7/ 


where r is the distance between them. 

The form of the function w{r) could in principle be calculated for 
any given pair of ions by the methods of quantum mechanics, but this 
has not actually been done (cf. Chap. VII); in the theory as developed 


t For a survey see the following: N. F. Mott and R. W. Gurney, Electronic Processes 
in Ionic Crystals (Oxford, 1940); F. Seitz, Modem Theory of Solids (New York, 1942); 
M. Bom, article in the Handbuch der Physik t 24/2 (Berlin 1933). 
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at present it has been usual to take an empirical form for the inter¬ 
action energy w(r), either 

w(r) — A r~ 8 with s ~ 9 (1) 

or w(r) — Ae~ r lp , (2) 


a form which is suggested by quantum mechanical calculations of the 
force between rare gas atoms.| 

The ions adhere to form a stable crystal because of the electrostatic 
attraction between unlike ions, which outweighs the repulsion between 
like ions. They keep apart because, when two ions overlap, the strong 
repulsive force with energy w(r) comes into play. On this basis we have 
to calculate the energy required to separate the crystal into its 
constituent ions. This may be done as follows: 

Let <f> denote the electrostatic potential at any lattice point of the 
crystal due to all the ions except the one occupying that point. This 
quantity has been worked out for a number of structures; for the rock- 
salt structure at a point normally occupied by a positive ion it is 

<f> = — 

where r is the shortest distance between ions of unlike sign and ol m is 
a numerical constant known as the Madelung constant. For the rock- 
salt (simple cubic) structure it has the value 

a M = 1*7476.... 

The electrostatic energy per ion pair is thus 

—oL M e 2 / r . 


In calculating the energy due to the overlap forces, the simplest 
assumption is that the forces fall off so rapidly with distance that only 
ions which are nearest neighbours need be taken into account. Since 
each ion has six nearest neighbours, the overlap energy per ion pair 

i8then 6 w(r), 

and the total energy per ion pair is thus 


U(r) — — 6w(r). 

r 


( 3 ) 


To determine the two unknown constants in the function w(r) we 
make use of experimental values of the lattice parameter and the com¬ 
pressibility (x)> The condition that the crystal is in equilibrium under 
zero pressure, namely, dU{r) j dr = 0 , (4) 

t Cf. § 33.3. 
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gives one relation between these quantities, and for the compressibility 
X a short calculation gives 


1 _ 1 d*U(r) 
X “ 18r dr 2 


(5) 


If we use the expression (1) for w(r), equations (4) and (5) give 


_ J_ 
6As r 5-1 


and 


I8r 4 

i)* 


The second of these equations gives s and the first A, all in terms of 
measurable quantities. Estimates of the repulsive energy w(r) can thus 
be made. 

The theory has been refined by including in the energy the van der 
Waals attraction between the ions, the repulsion between ions which 
are not nearest neighbours, the zero-point vibrational energy, and so 
on. By comparison of theoretical formulae with observed lattice 
constants and compressibilities, values of w(r) have been deduced for 
the interaction energy of halide ions with alkali, silver, thallium, and 
cuprous ions.f 

For the alkali-halides a particularly convenient form of interaction 
energy is given by Mayer;J this is (in ergs) 


iv(r) =-■ 4 X 10 _12 e“ (r " r *~ r ^, 


where p = 0-345 x 10~ 8 cm. and r v r 2 are ionic radii shown in the table. 


Table 7. Basic radii r x and r 2 x 10 8 cm. 


Li+ . 


. 0-475 

F“ . 

. 1110 

Na+ 


. 0-875 

Cl" . 

. 1-475 

K+ . 


. 1-185 

Br“. 

. 1-600 

Rb h . 
Cs + . 


. 1-320 
. 1-455 

I" . 

. 1-785 


The energy per ion pair required to separate the crystal into its 
constituent ions is, on the simple treatment given above, easily seen 
to be given by 2/ 


(6) 


This has a value of the order 5-6 eV. per ion pair, or 120 k.cals. As 
s ~ 9, the greater part of (6) is due to the electrostatic forces. \U\ is 


f For references cf. Mott and Gurney, loc. cit., chap. i. 
t J. E. Mayer, J. Cheni. Phys. 1, 270 (1933). 
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connected with the energy U A per ion pair necessary to dissociate the 
crystal into neutral atoms by the formula 

\u I - u a -e+i 9 

where I is the ionization potential of the metal atom and E the electron 
affinity of halogen ion. From calculated values of IJ (deduced from 
formula (6)), using known values of r, y and experimental values of 
U A , ./, one can thus deduce theoretical values of the electron affinities 
E , which are not in general known in any other way. The success of 
the theory lies in the agreement between the values of E deduced from 
different compounds; this is shown below\ 


Table 8 . Electron affinities of the halogens in k.cals. 


Metal with which 





combined. 

V 

01 

Br 

I 

Li ... 

950 

85-7 

81 2 

75-8 

Na .... 

96-5 

86-5 

80*9 

73-9 

K 

95-5 

87-I 

81-2 

73 2 

Ub .... 

95-8 

85*7 

82-0 

73-8 

Cs .... 

93-8 

87*3 

820 

74-2 

(Sutton and Mayer f) 




72*4 


It seems that the theory built up along these linos gives a satisfactory 
account of cohesion in alkali and silver halides. For oxides and sul¬ 
phides and in general for crystals where the ions carry two or more 
charges, no recent comparison between theory and experiment has been 
made.J The difficulties in doing so are duo to the following facts: 

(a) The van der Waals forces between oxide ions are large and 
difficult to estimate. 

(h) It is possible that some ‘homopolar* forces exist between the ions 
as well as the electrostatic attraction ; in other words, the over- 
y lap force is not wholly repulsive in character. 

/ (c) The oxide ion 0— is not stable in free space. 

/ 

37. Crystals in thermal equilibrium 
37.1. Lattice vibrations 

In a crystal in thermal equilibrium at a temperature T , we must 
consider three types of deviation from the perfect lattice: 

(а) The lattice vibrations of the atoms. 

(б) The existence of vacant lattice points. 

(c) The existence of interstitial atoms or ions. 

t P. P. Sutton and J. E. Mayer, J. Chem. Phye. 3, 20 (1935). 

X See an early attempt by J. Sherman, Chem. Rev. 11, 93 (1932). 
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The theory of the lattice vibrations will not be considered in detail 
here.f We may remind the reader of the rather crude (Einstein) model, 
in which each atom of a monatomic solid is supposed to vibrate inde¬ 
pendently of all the other atoms with frequency v ; the mean energy 
of each atom is thus , h * 

»{** + ?*£_! }• W 

The temperature hvjk is known as the characteristic temperature and 
denoted by 0. For temperatures large in comparison with 0, (7) reduces 
to 2>kT , giving a specific heat of 3 Nk, where N is the number of atoms 
per gramme. If x 2 is the mean amplitude of the thermal vibrations 

"a _ h*TlMk&\ (8) 

where M is the mass of an atom. 

In an exact theory of the lattice vibrations and of the contribution 
of the specific heat due to them,J it is necessary to analyse the vibra¬ 
tions of the lattice into its normal modes, and denote by n(v)dv the 
number of normal modes with frequency between v and v-\- dv\ the total 
number of modes for a solid of N atoms is 3 N, so that 

J n(v) dv = 3 N. 

The total integral energy is thus 

l n(v)^hv+ 

The calculation of n(v) for any real solid is a very difficult problem. 
Debye showed that for small v 

n(v) — const, r 3 

and showed how to calculate the constant in terms of the elastic 
constants of the solid.§ 

37.2. Vacant lattice points and interstitial ions 
In addition to the thermal vibrations, a crystal in thermodynamical 
equilibrium differs from a perfect crystal through the presence of 

(a) Interstitial atoms or ions. 

(b) Vacant lattice points. 

The case of a monatomic solid (metal) will be treated first. 

It will be possible for atoms from the surface layer to diffuse into 
the body of the crystal, where they will be found in the so-called 

f See, for example, R. H. Fowler, Statistical Mechanics, 2nd ed. (Cambridge, 1937) or 
F. Seitz, loc. cit. 

I For most materials this is much the most important contribution. 

§ For recent calculations of n(v) ef. M. Blackman, Proc. Roy. Soc. A, 148, 384 (1935), 
159, 46 (1937); Proc. Camb. Phil. Soc. 33, 94 (1937). Also see Seitz, loc. cit., p. J03. 
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‘interstitial positions’, as illustrated in Fig, 48 a. We may denote by 
Wi the energy necessary for the formation of an interstitial atom. 
W i will tend to be large for close-packed structures, for which the space 
available in an interstitial position is small and the distortion of the 
surroundings necessary to make room for an interstitial atom large. 



(a) (b) 

Fig. 48. Showing formation of interstitial ions (a) and vacant lattice points (6). 

In any case an elementary calculation*)* gives for the number of 
interstitial ions per unit volume 

»< - N i e“ J W kT 9 

where N { is the number of interstitial positions per unit volume. 

In the same way, vacant lattice points (holes) can be formed at the 
surface by diffusion inwards, as shown in Fig. 486. Let W h be the 
energy necessary to form a hole; then in equilibrium the concentration 
of holes is % = Ne-W* T , 

where N is the number of lattice points per unit volume. 

For close-packed metals it is probable that W h is less than W t , so that 
the holes will greatly outnumber the interstitial ions; for copper, theo¬ 
retical calculations;]: indicate that this is in fact the case. According 
to these workers, self-diffusion in metals takes place through the motion 
of holes. It is suggested that an atom will not move from one lattice 
point to another unless there is a hole next to it.§ 

In ionic crystals, in principle it is possible to have vacant lattice 
points and interstitial ions of both signs; in general, however, the energy 
required to put the large negative ion into an interstitial position is 


t The number of ways that the n i ions can be distributed is 

P « tf i !/n i !(iVr i --n i )!, 

and the free energy is therefore 

UiWi-kT log P. 

Making this a minimum with respect to charges in n it and remembering that <*§ N if 
the given equation is obtained. 

X H. Huntingdon and F. Seitz, Phys. Rev . 61, 315 (1942). 

§ For a discussion of the mechanism, of. R. P. Johnson, Phye. Rev. 56, 814 (1939); 
G. Wyllie, Proc . Phys. Soc. 59, 694 (1947). 
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so large that the concentration is negligible. Also any large volume 
of the crystal must be electrically neutral; otherwise large fields will 
be set up by the space charge. 

It is thus convenient to consider two types of defect, both of which 
are electrically neutral. 

(a) An interstitial ion (usually positive) and some distance from it 
a vacant lattice point from which the ion may be supposed to have 
come. This is called a Frenkel defect.| It will be noted that a Frenkel 
defect can be formed in the body of the crystal. If W f is the energy 
necessary, the concentration of Frenkel defects is given byj 

n = J(N { N)e~Wlw. 

(b) Two vacant lattice points, one of each sign, wandering about 

independently of each other. This is called a Schottky defect. If W 8 is 
the energy necessary to form a Schottky defect, the concentration 
will be n __ u e -\wjkT' 

The formulae are valid only if W s , W f differ considerably, so that only 
one type of defect is actually formed in appreciable quantities. 

It is believed that Schottky defects are formed in alkali halides some¬ 
what above room temperature,§ while Frenkel defects are formed at 
a concentration of about 10" 10 per atom in silver halides at room 
temperature. 

The ionic conductivity that all solid ionic crystals show at high 
enough temperatures is probably due to defects of one or other of these 
types, and either to the motion of interstitial ions, or to the motion 
of holes, or both. 

A direct proof of the importance of holes (vacant lattice points) is 
given by the well-known experiments of Koch and Wagner;|| crystals 
of silver halide were prepared containing small quantities of CdCl 2 or 
CdBr a in solid solution. In these crystals each Cd+ + ion replaces two 
Ag+ ions in the lattice; the crystal must thus contain at least as many 
holes as there are Cd ++ ions. The number of holes is thus greatly 
increased and the conductivity, especially at comparatively low tem¬ 
peratures, is greatly increased also. 

38. Electronic motion in crystals 

For the understanding of the motion of electrons in both metals and 
non-metals it is essential to study the motion of an electron in a field 

f J. Frenkel, Zeits.f. Physik, 35, 652 (1926). 

j Mott and Gurney, loc. cit., chap. ii. 

§ F. Seitz, Rev . Mod. Phys. 18, 384 (1946). 

|| E. Koch and C. Wagner, Zeits.f. phys. Chem. B, 38, 295 (1937). 
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which is periodic in space with the period of a crystal lattice. The 
necessity for this may be seen most easily by considering a crystal such 
as sodium chloride into which an extra electron has been introduced. 
(In practice this can be done by photo-electric excitation, cf. Mott and 
Gurney, loc. cit., Chap. IV.) One can imagine the electron located'on 
one of the metal ions and thus forming a metal atom. But if the 
electron transfers its position to the next atom, there is no change of 
energy; and the potential barrier between the two positions can easily 
be penetrated by tunnel effect. The electron is thus free to move 
through the crystal. The wave functions describing its motion will be 
wave packets or plane waves, like those describing the motion of an 
electron in free space, and can extend right through the crystal. 

Similar wave functions are used to describe the free electrons in 
metals. Their use depends on the assumption that, so far as its effect 
on a given electron is concerned, the field of all the other electrons can 
be averaged in the sense of the Hartree approximation (Chaps. V, VI). 
This will be discussed further in the section devoted to metals. 

In either case, therefore, it is necessary to study the motion of an 
electron in a field which is periodic with the period of the crystal lattice. 

38.1. Motion in a periodic field 

In order to understand the properties of an electron in a periodic 
field we shall consider first the one-dimensional case; we consider, then, 
the solutions of the Schrodinger equation 

j?+%"w-vm = «■ <»> 

where V is a periodic function of x with period a. 

For a solution iff to represent a possible state of the electron in an 
infinite metal, iff must be bounded for all x. We shall show that the values 
of the energy W for which such solutions are possible fall into a series 
of bands , as shown in Fig. 49. All such solutions can be written in the 

^ oim e ikx u( x) (— it fa < k <7r/a), 

where u(x) is periodic with period a. 

For each allowed band of energy values, k varies continuously from 
— 77 fa to 7 r/a, so that a series of allowed energy bands can be defined, 
W^k), W % {k), and so on. 

Various proofs have been given, and we shall refer to one due to 
Kramers.f Kramers first points out that, if ipi(x) and j/r 2 (#) are inde- 

t H. A. Kramers, Phyeica , 2, 483 (1935); see also F. Seitz, loc. cit., p. 278. 
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pendent solutions of the Schrodinger equation, then since if/^x+a) is 
also a solution, it follows that 

ipi(x+a) = 

It is then easy to show that two linear combinations of i/j 1 , i/j 2 may be 
chosen, <f> v <f> 2 , with the property 
<f>i{x+a) - A x 

“ A 2 0 %(%)• 

From these equations we have, differentiat¬ 
ing each equation and multiplying by the 
other, 

(0102 0102 )x+a ~ ^1^2(0102 01 ( f >, 2 l )x* 

( 10 ) 

For any Schrodinger equation, however, if 
tf> lt 0 2 are any two independent solutions 
corresponding to the same energy, 

0102 01^2 ~ 

so that 01 02 “0i 02 — const. 

It follows from (10) that A X A 2 = 1. 

Now if ifj is bounded, it is clear that both A x and A 2 must be real or 
imaginary quantities with modulus unity. For if this were not the case, 
then \i/j\ would increase either with increasing or decreasing x , and 
would not be a bounded function. Thus we may write 

Aj — e ik , A 2 — er ik . 

It follows that the two solutions <f> v <^ 2 have the forms 

e ±ikx u k (x), (11) 

where u k (x) is periodic in x with period a. 

It follows that the lowest state of each band of energy values is 
periodic, and that the higher states consist of plane waves modulated 
by the periodic field, f 

Wigner and Seitz J have given the following method for determining 
the wave function in the lowest state in one of the energy bands in 
close-packed metals, which is of particular importance. They point 
out that if planes are drawn bisecting the lines which join nearest 
neighbours in the crystal, a polyhedron is formed surrounding each 

f For an. evaluation of the wave functions in a particularly simple type of periodic 
field, cf. R. Kronig and W. G. Penney, Proc. Roy. Soc. A, 130, 499 (1931). 

X E. Wigner and F. Seitz, Phys. Rev. 44, 804 (1933); cf. also Mott and Jones, loc. cit., 
chap. iv. 

6072 p 



Fia. 49. Allowed values 
of the onergy IF as a func¬ 
tion of wave-number k. 
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atom. Supposing we now look for a periodic wave-function u(x,y,z) 
which has the symmetry of an s wave function (spherical symmetry) 
in the neighbourhood of each atom. Then clearly du/dn must vanish 
on the boundary of the polyhedron. In order to calculate u, the poly- 



Fig. 50. Wave functions for metallic sodium, copper, and silver. 


hedron is replaced by a sphere of equal volume, radius r 0 . Then the 
Schrodinger equation 

1 d I o du\ . 2m rxr T/ . A 

?Tr( r dr) + l‘ {W - r)u = 0 
has to be integrated subject to the boundary condition 
du A r 

— = 0 for r = r n . 
dr 

It is clear that a whole series of energy values will exist, for which 
u vanishes on 0 , 1 , 2 ,... spheres round each nucleus. For sodium with 
configuration (l«) 2 (2s) 2 (2p) 6 (35) 1 , the states with respectively zero and 
one node will correspond to the X-ray K and L levels, and the wave 
functions will not differ much from those for the free atoms. The case 
corresponding to the valence electrons of various metals is shown in 
Fig. 50. For sodium states with more than two nodes correspond to 
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high positive energies W. Each of these wave functions and energy 
values corresponds to the lowest state of a band of energies. These 
bands will be referred to as 5-bands or «s-zones. 

The variation of energy with k within an 5-band can be investigated 
by means of an approximate method known in the literature as ‘the 
method of tight binding’. It will be seen that the wave-function tp k 
can be written in the form 

<£k = £ e’'< kr 'V> fc (r—r,), (12) 

where <f> k {r) is some function with the requisite number of zeros tending 
to zero at a few atomic radii from r = 0, and the summation is over all 
lattice points r t . The wave function of Wigner-Seitz type illustrated 
in Fig. 50 could, for instance, be so represented. Let us assume this 
done; and make the following approximations: 

(а) (f> is independent of k; 

(б) overlap is negligible between the wave-functions cf>(r— r z ), 
<f>(r—r r ) unless the lattice-points r l} r* are nearest neighbours. 

Then, denoting by H the Hamiltonian operator, 

—-^-V 2 +F, 

2 m 


the energy W k of the state k will be given by 

W k = J ft H>/j k dr/j ft ft dr. 
With the approximations given above, this reduces to 

m _ 

* ~ A'-B'Z^' 


(13) 


Here p / is the vector joining an atom to its nearest neighbours, and 
the summation is over all the neighbours of any one atom; also 

A = J dr, 

A' = J dr, 

B = j <f>*(r)H<f>(r—p) dr, 

B' = J>(rtf(r—p)<*r. 

For a simple cubic lattice of lattice constant a for example, 

2 = 2{cos£ a .a+cos& 1/ a-f cos& s a}. 
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Prom this formula, we see that all values of k within the cube 

— rr/a < k x <' tt ja 
—77 ja <; ky < 77 ja 

— n/a k z < nja 

correspond to independent wave functions with energies in the 5-band. 
The correspondirig cube in -space (Fig. 51) is called a 4 Brillouin zone’. 


k r 


(b) 

Ftg. r>l. Zone in A-space for states in (a) a simple cubic lattice, and (b) in a 
hexagonal close-packed lattice (two dimensions). 

It is clear that for a crystal of N atoms the number of electronic 
states in the zone is 2N, since two spin directions are allowed for each 
wave-function i/; k . This is the case for* any crystal structure. For the 
simple cubic it is instructive to verify this result by quantizing k by 
laying down the condition that ijj must be periodic in x< ?/, ^ with period 
na where n is a large integer. This is a conventional quantizing condi¬ 
tion; it would be appropriate for a wire bent round into a circle, for 
a coordinate measured along the circumference of the wire. We deduce 
from this condition that 

k x — "Inn'/na ( n' = — ,(),..., +-|n), 

which gives the required number of values. 

To find the limits of the zone for structures 
other than simple cubic, we must draw in k- 
space the lines for which first-order Bragg re¬ 
flections occur. This will enclose the required 
zone. This may be seen as follows: Fig. 52 shows 
a series of crystal planes, and it is supposed that 

„ a wave-vector k in the direction shown is re- 
Fig. 52. Showing Bragg 

reflection from crystal fleeted by them. Then, by the Bragg condition 
P lanes - for reflection, in the wave function (12) the sign 

of the coefficient of the wave function <^ k will change in going from one 
atom to the next. It is clear that this represents the extreme form of the 
wave function, in the sense that a larger numerical value of k in the 




ky 


(a) 




§38 


ELECTRONIC MOTION IN CRYSTALS 


213 


same direction would lead to a repetition of wave functions already 
described by smaller values. 

We shall illustrate this principle by considering the s Brillouin zone 
for close-packed atoms in a plane (Fig. 51 6). There are then three sets 
of planes, at angles of 120° to each other, and spaced V3a/2 apart. 
The first Brillouin zone is thus a hexagon of side 1/V3a and volume 
V3/2a 2 . Since the number of atoms per unit area is V3/2a 2 , it will be 
seen that we have in the zone a number of states equal to the number 
of atoms. 

For 2 ei{k?l) we obtain 

2 c t( k P/) ^ 2 cos k r a + 4 cos \k x a cos k V3 lc y a . 

It will be seen from formulae (13) that near the bottom of any zone 

W ~ const. + ak 2 . 

We may write ol = fa 2 /2m *] 

m* will then be called the ‘effective mass’ of an electron moving in 
the crystal lattice. For energies near the bottom of a band, an electron 
moving in a periodic lattice will behave exactly as a particle in free 
spacef with charge e and mass m*. The approximation is often made 
of replacing m* by m. 



.Fig. 53. p wave function of lowest energy for metallic 
lithium; the dots represent adjacent atoms. 

In addition to the 5-zones already described, p- or d-zones can be 
formed by a similar process. They will not be discussed in any detail 
here. Fig. 53 shows the general appearance of the lowest p wave 
function for lithium. The symmetry of such a wave function is such 
that it will be described by k x = 77 /a rather than k — 0 . 

Although the energy corresponding to this wave function will in 
general be higher than that of the state (k x ~ 7 r/a, k y — k z = 0) in the 
5-zone, it will not necessarily be higher than all states in the 5-zone. 
Thus the energy band of the 5-zone may overlap the energy band of 
the ^-zone. If one plots against energy W the number N(W)dW of 
states in the energy range dW one gets a curve such as that of Fig. 54a. 

f It is assumed that the lattice is rigid, i.e. that the atoms are not vibrating. Under 
these conditions the mean free path is infinite. In practice an electron will move with a 
finite free path, as shown in § 40.7. 
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39. Conductors and insulators 

In metals and also non-metals such as diamond one can treat each 
electron as moving in the field of the positive ions and the field of all 
the other electrons, averaged in the sense of the Hartree or Fock 
approximation (Chaps. V, VI). This field is, of course, periodic, so that 
the wave functions are of the type described in the last section. 





I N(W) 



(c) (d) 

Fig. 54. Density of states in solids; N( W) d W gives the number 
of states with energies between W and IF -f- dW. (a) Monovalent 
metal; (b) divalent metal ; (c) insulator; (d) graphite. 


Only two electrons are allowed to be in each state specified by a 
quantized value of k. We have seen that each 5-zone contains 2 N 
states. Therefore, for monovalent metals (sodium, silver, etc.), the first 
zone will be half full. This is shown in Fig. 54, where curves plotting 
N(W) against W are shown. 

If a zone is half full, it is clearly possible for the solid to carry a 
current, because within the zone a state can exist in which there are 
more electrons moving in one direction than the other. It is clear, 
therefore, why atoms such as Cu, Ag, Au, Na, K, etc., form metallic, 
conducting lattices. 

On the other hand, if a zone is fully occupied, the electrons of the 
zone cannot contribute to the current because exactly as many are in 
states with wave-vector k in one direction as in the other. Insulators, 
therefore, are materials in which each zone is either full or empty 
(Fig. 54 c). An important intermediate case (graphite)f is sketched in 
Fig. 54 d. 

The fact that atoms with two electrons in the outer ring form con¬ 
ducting lattices (except helium) shows that for these materials the 5 -zone 
must overlap the p-zone, as shown in Fig. 54 b . 

t P. R. Wallace, Phys. Rev. 71, 622 (1947). 
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It must be emphasized that the model used here, in which each 
electron is supposed to move in the averaged field of all the other 
electrons, is of course an approximation. What is assumed is that, 
if an antisymmetrical determinant of the type of Chapter VI, equation 
(11) is set up, in which the orbital wave functions are of the form 
e i{kr) u k ( r), then this will be a useful approximation of zero order to the 
true wave function. It is very difficult to give convincing reasons why 
this should be the case for metals. 

For further discussions of the validity of the collective electron 
model, and for attempts to improve it, cf. papers by Schubin and 
Wonsowski,f and by Bethe.J 

40. Applications of the collective electron model to metals 

In this section an outline only can be given of some of the applica¬ 
tions of the collective electron method to the properties of metals: 

40.1. Emission and absorption of soft X-rays 

If any metal is used as an anticathode in an X-ray tube, part of the 
emitted radiation will correspond to transitions from the conduction 
band (the occupied states) into a lower X-ray level. If the mean energy 



Fia. 55. L. a emission bands of sodium and magnesium. 


hv of the emitted radiation is not too great compared with the width 
of the band of energies of the occupied states, the form of the emitted 
band can be determined experimentally. Fig. 55 shows the form of 
the L s band for sodium and magnesium.§ From the theoretical point 
of view, we expect the emitted intensity to be proportional to 

N{W)p(W), (14) 

where p(W) is the transition probability from a state of energy W into 
the X-ray level. The form of the band shown in Fig. 555 has the 
characteristics that we expect for a divalent metal, namely two over¬ 
lapping zones and a sharp cut-off for high energies. 

t S. Schubin and S. Wonsowski, Proc. Roy. Soc . A, 145, 159 (1934). 

t H. Be the, Zeits.f . Physify 71, 205 (1931). 

§ From H. W. B. Skinner, Rep. Progress in Physics , 5, 271 (1938); see also H. M. 
O’Bryan and H. W. B. Skinner, Proc. Roy. Soc. A, 176, 229 (1940). 
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In the same way the absorption intensity may be expected to be 
given by (14), where N( W) refers to the empty levels. 

For a review of the emission and absorption of soft X-rays, and their 
interpretation in terms of theory, cf. Skinner, loc. cit. 


40.2. Specific heats 

The electrons in a metal are subject to the Fermi-Dirac distribution 
law. According to this law, the probability that at temperature T a 
state with energy W is occupied is 

f(W) = ( 15 ) 


It means that if W is much less than £, i.e. if £— W kT , the probability 
is practically unity; while, if W— £ IcT , the probability is approxi¬ 
mately 


The parameter £ is given by the equation 

oo 

j N(W)f(W) dW — total number of electrons. 

6 

£ differs only slightly from the energy of the highest occupied state at 
the absolute zero. f(W) is illustrated in Fig. 56. 



Fig. 56. Fermi distribution law for electrons in a metal. 


The energy of the electrons at temperature T is 

J N{W)j(W)WdW. 

This differs from the value at absolute zero by a term of order 
N(t)(kT ) 2 , since the number of excited electrons will be of order N(£)kT 
and they will each have energy of order kT . The contribution of the 
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electrons to the specific heat is thus linear in the temperature. A calcu¬ 
lation of its magnitude givesf 

|7t 2 F7W(£). 

If the calculations refer to a gramme-atom, N(Q is of order 

where N is the number of free electrons per gramme-atom. Assuming 
this to be of the same order as the number of atoms, we see that the 
specific heat is of order 



Since £ is equal to several electron volts, the electronic specific heat 
at room temperature will be of the order of 1 per cent, of that due to 
the vibrations of the atoms. 

At low temperatures, however, where the contribution of the lattice 
vibrations varies as T 3 , the contribution of the electrons is the pre¬ 
dominant term, and at the temperatures of liquid helium has been 
observed for a number of metals. Expressing the linear term per 
gramme-atom as yUT , y has the following values (X 10~ 4 ): 

Cut Al§ Ni|| Feft Pdtt Pt* 

0-888 1-742 8-742 6-0 16 80 

The value for aluminium holds only down to the transition point to 
the superconductive state, 1*13° K. 

The high values for the transition metals will be noted. These are 
associated with a partly filled band formed from the ri-states of the free 
atom.§§ 

40.3. Paramagnetism 

As for the specific heat, one can argue that the number of free 
electrons (free to change their spins without infringing the exclusion 
principle) is lcTN(l) and that each of these contributes fj^jkT to the 
susceptibility (/x = eh/2mc). An exact calculation gives for the suscep- 
tibility VW«). 

t Cf. N. F. Mott and H. Jones, Theory of the. Properties of Metals and Alloys , p. 178 
(Oxford, 1936). 

t J. A. Kok and W. H. Keesom, Physica, 3, 1035 (1936). 

§ Ibid. 4, 835 (1937). 

|| W. H. Keesom and C. W. Clark, ibid. 2, 513 (1935). 
tt W. H. Keesom and B. Kurrelmeyer, ibid. 6, 364 (1939). 
tt G. L. Pickard, Nature , 138, 123 (1936). 

§§ Mott and Jones, loc. cit., chap. vi. * 
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One therefore expects a correlation between the electronic specific heat 
and the susceptibility. As shown above, the paramagnetic metals Pd 
and Pt do in fact have large electronic specific heats; but the values 
of N(£) determined from the specific heats are 2-4 times smaller than 
those from the paramagnetism. It is known that for ferromagnetic 
materials an interaction between the electrons exists which leads them 
to set themselves with their spins parallel; for Pd and Pt a force of 
this type probably exists also, not strong enough to lead to ferro¬ 
magnetism but enough to increase the paramagnetism. 

40.4. Thermionic emission 

An equation for the current emitted from a metal surface at tem¬ 
perature T under fields strong enough to give saturation can be derived 
as follows, it being assumed that the effect of Bragg reflection (i.e. of 
the zone structure) of the electrons in the metal can be neglected. 

Within the metal, the number of electrons with moments between 
p x , p x +dp x , etc., and in the volume element dx , dy, dz is given by the 
Fermi-Dirac law 

2 dxdydz dp x dp y dp z I r __ p 2 
h z etw-OfkT+i \ 2m 

For electrons with energy great enough to leave the metal, this 
becomes 2 

p dxdydz dp x dp y dp z S~ W)ikT . 

Let the surface of the metal be the i/z-plane. Then we have to con¬ 
sider only the motion perpendicular to this plane. Since 
00 

J e ~p'i 2 mkT dp _ ^(2 TrmkT), 

— 00 

it follows that the number of electrons per unit area of surface in a 
layer of thickness dx and with p x in the range dp x is 

e^T e -iPlImkT dpx dx _ 

Let £+<£ be the minimum energy that an electron must have in order 
to leave the metal, and let r be the reflection coefficient for an electron 
with momentum p x . Then the number leaving the surface per unit 
area per unit time is 

^W e ilkT J (l-r)i^e-iPil mkT dp x . 
l+4> M 
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On the assumption that r ~ 0 (probable in view of the existence of the 
image force) this gives for the current per cm. 2 

AT 2 e-ti kT , 

where A = 477 nnkPe/h? ~ 120 amp./cm. 2 

For a discussion of the value of A for different models for the surface, 
cf. R. H. Fowler, Statistical Mechanics , 2nd ed. (Cambridge, 1936), 
p. 349, and for a discussion of the validity of the formula, cf. F. Seitz, 
Theory of Solids, p. 165. 


zero 



Fig. 57. Showing potential energy of an electron at the boundary of a metal. 

- In the absence of a field. 

-With a field. 

In fairly strong fields the work function is reduced; this effect is 
known as Schottkyf emission. This is because the field on an electron 
outside the metal at a distance x from the surface is that due to the 
image force —e/4 x 2 . Outside the surface, then, the potential energy 
of an electron is —e 2 /4a?. When the field F is introduced an additional 
term ~eFx is added to the potential energy. The function 

—eFx—e 2 l 4tx 

has its maximum at x = ^(e/F), and has there the value — cJF*. 
Thus the effective work function </>' is 

f - 

This is shown in Fig. 57. 

The current I F for a field F should thus satisfy the relation 
T(log 4-log J 0 ) el 

F* ~ h ’ 

This relation has been shown to be valid over a remarkably wide range 
of temperatures and for fields up to 10 6 volt/cm. by de Bruyne.J 

40.5. Cold emission 

For very strong fields (of the order 10 7 volt/cm.) emission from an 
electrode into a vacuum will take place by tunnel effect through the 

t W. Schottky, Zeits.f. Physik, 18, 63 (1923). 
t N. A. da Bruyne, Proc. Roy. Soc. A, 120, 423 (1928). 
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potential barrier ABC of Fig. 57. A current is thus obtained which is 
independent of temperature. 

The theory of the effect is as follows. Let us denote by D(p x ) the 
probability that an electron with momentum p x in the ^-direction will 
pass through the barrier. Then the number of electrons leaving unit 
area per unit time is 9 r 

£3 J ~ D iPx) d Pxdpydp s , 

where the integration is over all values of p such that 

Pl+Pl+Pl < 2m£. 

Setting 2= p 2 , and writing p x — p—9, it is easily seen that this 


£ J D(p-6)ddd, 


where the upper limit of the integral may be taken to be infinity, since 
D decreases rapidly with increasing 0. 

The transparency factor D will be of the form Aer$, where A is a factor 
of the order unity, and j3 may be obtained from the WKB approximation 
(Chap. I, §3.3) as follows: 

J3 == 2 k J V(F- W) dx (k = 2 mjh 2 ) 

(X-pleF 

= 2 K J V (eF x ) dx (2m£ — p 2 ) 

- Mx-WIF- 

For values of p x in the neighbourhood of p we have 

O __ 1 | ^ 0p . \ 




+••• • 


Putting A ~ 1, this gives for the number of electrons emitted per unit 
area per unit time 2 m 

27 Ttl<f) 

If the approximation A ~ 1 is not made, a result is obtained which 
does not differ from this appreciably*!* if <f> ~ 

40*6. Metallic cohesion 

No detailed account will be given in this book of the theory of metallic 
cohesion. In the treatments given up till now the energy of the metal 
has been divided up into the following parts: 

(a) The interaction of the conduction electrons with each other and 
with the metallic ions, and the electrostatic interaction of the 
latter with each other. 

f Cf. R. H. Fowler, loc. cit. 
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(6) The repulsion which comes into play when the ions overlap. 

In copper or silver ( b ) is important; the ions appear to be in contact, 
so that the force between them is the determining factor for the com¬ 
pressibility; for the alkali metals, on the other hand, the ions do not 
overlap. For metals such as tungsten it may not be possible to make 
a distinction between ions and conduction electrons. 



Em. f»8. Density of electronic states, (a) face-centred, 

(b) body-centred cubic. 

in considering the force (a), it is of importance that the ‘atomic 
sphere 5 surrounding each atom is electrically neutral; thus in calculat¬ 
ing the potential energy of the system one has only to consider the 
interaction of the electrons within eacli sphere with each other and with 
the ion. This interaction is mainly responsible for the cohesion of the 
metal. 

For metals with more than one electron per atom the kinetic energy 
of the electrons will be much more sensitive to the crystal structure 
than for metals with one electron. This is because of the zone structure 
explained in an earlier section. Fig. 58 shows the density of states. 
N(W)dW is the number of electrons with energies between W and 
W-\-dW for the conduction electrons in the <s-zone in a metal with body- 
and face-centred cubic structures, and the same atomic volume; the 
calculations are due to H. Jones.f It will be seen that for metals with 
only one electron per atom there is little difference between the energies, 
but for larger numbers the body-centred structure lias the lower total 
energy. On the basis of these ideas Jones has shown why in alloys of 
CuZn, CuAl, CuSn, etc., transitions from the face- to the body-centred 
structure and other similar transitions occur at a nearly constant ratio 

| H. Jones, Proc. Phya. Soc. 49, 243 (1937). 
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of free electrons to atoms (Hume-Rothery rule). For these two parti¬ 
cular structures it is not difficult to see why this should be so. For the 
face-centred structure reflection occurs first for the (111) set of planes, 
and thus for a wave-number h equal to V3 c?/7t. For the body-centred 
cubic, on the other hand, where the longest wave-length for the Bragg 
reflection is given by the (110) fines, h is ^2djn. In these formulae d is 
the lattice parameter in either case. If a 3 is the atomic volume, then 
since the two structures contain 4 and 2 atoms respectively in the unit 
cube, these wave numbers are in the ratio 

V3a/2* and V2a, 

i.e. l-37a and l-41a. 


40.7. Electrical conductivity 

The wave mechanical theory of electrical conduction in metals is 
based on the following deductions from the theory: 

(a.) Free electrons will go through a perfect crystal lattice for an 
infinite distance without being scattered. 

(j8) Scattering, giving a finite mean free path, can be due to im¬ 
purities or foreign atoms in solid solution, thermal vibrations, 
crystal boundaries, or any other disturbance to the periodicity 
of the lattice. 

(y) In the scattering process an electron gains or loses an energy of 
the order kT and thus small compared with £ (the Fermi energy); 
the only electrons which can make collisions are those with kinetic 
energy approximately equal to £. 

Calculations of the conductivity due to heat motion at low tem¬ 
peratures (T < 0, where 0 is the Debye temperature) will not be 
considered here; if T > 0 for pure metals, and also for the resistance 
due to impurities, it is possible to express the conductivity of a metal 
in the following way (Mott and Jones, loc. cit., p. 258 et seq.): 

The conductivity a is given by 


or == ne 2 r/m , 

where n is the number of free electrons per unit volume and t , the time 
of relaxation, is given by Ifr = vNA, 

where N is the number of atoms per unit volume, v the maximum 
velocity of the Fermi distribution, and A an effective collision area 
given by 


A — j (1—cos 6)1(6) dco, 
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where 1(0) is the cross-section for scattering into the solid angle da) by 
each atomf (in an alloy the mean probability). 

From this formula we can deduce that for T > 0 the resistance of 
a pure metal is proportional to T ; for if x is the displacement of any 
one atom from its mean position, it is clear that the scattered amplitude 
is proportional to x , and hence that 1(6) is proportional to x 2 . The 
mean value of x 2 , however, is proportional to potential energy of all 
the vibrating atoms and thus to half the total heat content. This in its 
turn is proportional to T. 

We may deduce also an instructive formula due to NordheimJ on 
the resistance of alloy systems such as AgAu which form a continuous 
range of solid solutions. Suppose that the two types of atom are 
denoted by A and B and that V A (r ), V B (r) denote the potential energies 
of an electron in an atom of either type. If the number of atoms of 
the two types in the alloy are in the ratio c to 1—c, then the mean 
potential energy in all atoms is 

cV A (r)+(l-c)V B (r). 

In A atoms the potential differs from this by 

(1 -cWb-Va) 

and in B atoms by c (Va~~^b)* 

The intensities of the waves scattered from atoms of the two types are 
thus proportional to (1—c) 2 c 2 

and hence the total scattering (resistance) is proportional to 

C(l—C)2+(1_C)C2, 

which reduces to c(l—c). (17) 

Equation (17), therefore, shows the way in which the residual resistance 
of an alloy varies with composition. Good agreement with experiment 
is obtained for alloys such as PdPt, AgAu. 

41. Electronic motion in non-metals 

In this section a summary will be given of the electronic behaviour, 
from the theoretical viewpoint, of ionic and other crystals which are 
insulators at low temperatures and if not exposed to light which they 
absorb. 

Such crystals have a full zone of energy levels separated by a finite 
energy gap from an empty zone (the ‘conduction band’). If an extra 

t Cf. chap. ix. 

X L. Nordheim, Ann. der Physik, 9, 607 (1931). 
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electron is introduced into the conduction band, it will move freely 
through the lattice. Also if an electron is removed from a full band, 
the resulting vacancy, called a ‘positive hole’, can carry current. 
A ‘ positive hole’ behaves like an electron of positive charge; the analogy 
with Dirac’s theory of the positive electron will be noted. 

Materials which conduct through the motions of electrons in a 
normally full band give a Hall coefficient*)* of sign opposite to that 
obtained in materials where the current is due to positive holes; this 
is the most important way of distinguishing them. 

If a material contains per unit volume n electrons (or positive holes) 
of effective mass m, its conductivity is 

ne 2 

a = — T , 
m 

where t is the time of relaxation. Calculations of t for ionic crystals 
have been made;J since in ionic crystals electrons will have energies 
of the order kT (0-025 eV.) instead of about 5eV. as for metals, the 
changes in the potential due to atomic vibrations will no longer be 
small compared with the kinetic energy of the electron. It is rather 
doubtful therefore whether the perturbation theory on which these 
calculations are based is a good approximation. In any case one expects 
a mean free path of the order of a few multiples of the interatomic 
distance, except perhaps at low temperatures. Thus with v ~ 10 7 
cm./sec., r ~ 10~ 14 sec. 

.Electrons can be introduced into the conduction band in the following 
ways: 

(a) Excitation from the full band, either through the absorption of 
a quantum of radiation (photo-conductivity) or through the 
passage of a fast charged particle. 

(b) Excitation by thermal energy of an electron in an ‘impurity 
centre’ (see below). This occurs in semi-conductors. 

(c) An electron can enter at the boundary of a crystal, e.g. from a 
metal electrode. 

41.1. Semi-conductors 

In most semi-conductors the conductivity would be low for a pure 
crystal. The conductivity has its origin in the presence of a number 

f The phenomenon known as the Hall effect is the following: If a current j flows in a 
conductor perpendicular to a magnetic field H, an e.m.f. E is observed perpendicular to 
j and H, The Hall constant is defined as EjjH. It is easy to see that eHfi/c = eE, where 

the velocity of drift in the direction of the current, is jjen, n is the number of carriers 
per cm.® Thus EjjH — 1 /me. 

t H. Frdhlich and N. F. Mott, Proc. Roy. Soc. A, 171, 496 (1939). 
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of ‘impurity centres’; for these the energy Q required to remove an 
electron into the conduction band is low enough, at the temperature 
considered, for a sufficient number of electrons to be free to give a 
significant contribution to the conductivity. 

According to the theory originally given by A. H. Wilson,f if A is 
the number of impurity centres per unit volume, the number of free 
electrons (in the conduction band) is, per unit volume, about 

3X10 

Various types of impurity centres are believed to exist, as follows: 

(а) In materials such as zinc oxide a small excess (say, one part in 
10 4 ) of zinc can exist. The zinc ion can be lodged interstitially in the 
lattice. Round such an ion the force on an electron in the conduction 
band is e 2 //cr 2 5 where k is the dielectric constant. It can thus be shown 
that there will exist, just below the level of the conduction band, a 
series of stationary states leading up to a series limit, just as for a free 
atom; but that, for the ^-states, at any rate for which the field near 
the centre does not matter much, the order of magnitude of the energies, 
measured from the bottom of the conduction band as zero, is 

—me 4 /2S 2 A 2 , (18) 

or smaller by a factor k 2 than for a free atom. Also the radial extent 
of the wave function is of order 

n 2 h 2 K/me 2 . 

These results will apply, as regards order of magnitude, for the ground 
state (n — 1), provided that the free atom is too large to go into the 
interstitial space. Since k ~ 10 for most semi-conductors, it will be 
seen: 

(i) that the energies required to release an electron may well be as 
small as 0-1 eV.; 

(ii) that the radial extent of the dissolved atom may be of the order 
10-30 A, and interaction between the centres may be expected 
at quite low concentrations. This may account for the known 
fact that Q often depends on concentration. 

(б) Excess metal may also be taken up in an ionic lattice through 
the presence of points in the lattice from which a negative ion is missing. 
The field round such a point will be again of the form e 2 //cr 2 , an electron 
can be trapped there, and the same arguments about small energy and 

t A. H. Wilson, ibid. A, 133, 458 (1931); Tfieory of Metals, p. 65 (Cambridge, 1936); 
Semi-conductors and Metals (Cambridge, 1939). 

5072 q 
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large radius will apply. The ‘i^-centres’ or colour centres which can 
be introduced into alkali halides by heating in metal vapour are of 
this type. 

(c) If an oxide contains excess oxygen, this will be taken up through 
the presence of lattice points from which a metal ion is missing, the oxygen 
ion being too big for an interstitial position. To keep the substance 
neutral, an equal number of positive holes must be present, which will 
be trapped in the neighbourhood of these points. As the temperature 
is raised, some will become free, and conduction becomes possible 
through the motion of positive holes. This is the mechanism responsible 
for conduction in Cu 2 0 with excess oxygen. 

( d ) In Se, Si, SiC the mechanism is not understood in detail. One 
can, however, say that the presence of atoms with an electronic struc¬ 
ture which is not a closed shell in numbers greater than that demanded 
by the crystal structure should give electronic (excess) conduction, and 
the presence of vacant lattice points should give defect conduction 
(conduction by positive holes). The higher the dielectric constant, the 
lower will be Q , other things being equal. 

41.2. Optical properties of impure crystals 

Any dissolved atom (i.e. electron trapped in the field of an interstitial 
ion or place where a negative ion is missing) should have in principle 
a line absorption spectrum leading up to a series limit. This is because 
the electron moves in a Coulomb field superimposed on the periodic 
field of the lattice. In practice these absorption lines will be greatly 
broadened by the vibration of the crystal. 

The well-known jF-band which accounts for the colour of alkali 
halides with excess metal is probably an s-p transition of the trapped 
electron. 

Absorption of light in the band due to an impurity, therefore, will 
not lead directly to photo-conduction; nevertheless photo-conductivity 
is often observed, for instance, in alkali and silver halides. This is 
probably because the energy interval between the excited state of the 
centre and the bottom of the conduction band is so small that heat 
motion enables the electron to escape before it falls back to the ground 
state, except at very low temperatures. 

41.3. Photo-conductivity 

One must distinguish between photo-conductivity of a semi-conduc¬ 
tor, where the absorption of light merely increases the conductivity 
already present, and the primary photo-current in insulators (alkali 
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halides with /'-centres, silver halides at low temperatures, etc.). The 
latter lias been investigated by R. W. Pohl and his school.f In this 
work a crystal is placed between electrodes, given a short illumination, 
and the quantity nex measured, where n is the number of electrons 
released (usually equal to the number of quanta absorbed) and x the 



i i 


|< oc *-{ 

Fig. 59. Path of an electron released by light in 
an alkali halide crystal. 

distance drifted by each electron before it is 'trapped’. This trapping 
is a marked feature of insulating crystals; an electron in the conduction 
band survives only a certain time before being trapped. The energy 
necessary to release an electron from a trap is, of course, less than from 
one of the original impurity centres, x , the distance drifted, will be 
proportional to the field F and inversely to the number of traps. 

Other examples of trapping are: 

(a) In zinc sulphide phosphors, where illumination at low tempera¬ 
tures gives rise to trapped electrons which return to their original 
centres with emission of radiation only after warming up. 

(b) The action of light on photographic emulsion, where the first 
process appears to be the release by light of an electron which 
is trapped at a point where the latent image will be formed. 

41.4. Optical properties of pure crystals; the exciton 

In this section we discuss the optical absorption of a pure non- 
metallic crystal, that is to say, the absorption spectrum due to the atom 
or ions that form the crystal lattice. It should be emphasized that any 
allowed transitions of this type will lead to a very strong absorption 
coefficient, of the order 10 6 cm.- 1 

t R. W. Pohl, Proc. Phys, Roc. 49 (extra part), 3 (1937); F. Seitz, Rev, Mod. Phys. 
18, 384 (1946); N. F. Mott and R, W. Gurney, Electronic Processes in Ionic Crystals , 
chap, iv (Oxford, 1940). 
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In Fig. 60, if a quantum of frequency hv 0 is absorbed, a free electron 
and a positive hole are produced; both can move in the lattice inde¬ 
pendently, giving (in theory) photo-conduction. The frequency hv Q is 
not, however, the lowest that can be absorbed, because it is possible 
for the electron and positive hole to exist in a series of states in which 
they are bound in one another’s field. If r is the distance between them 



** x o * Direction of 

Blocking direction large current 


(c) (d) 

Fig. 60. Potential energy of electron at metal/non-motal interface: (a) insulator; (6) 
semi-conductor in equilibrium; ( c ) field in blocking direction; (d) field in opposite 
direction. 

and R the coordinate of the centre of gravity, we shall expect these 
bound states to be of the form 

^ kR V,(r), 

where ift n (r) are wave functions similar to those for a hydrogen atom. 
An electron and positive hole bound together in this way has been 
termed by Frenkelf an exciton; the formulation given here is due to 

Wannier.J 

If an exciton is formed by the absorption of a quantum of light, it 
will be formed in a state for which k = 0; therefore, owing to the 
Coulomb field between a positive hole and an electron, we expect the 
absorption spectrum to consist of a series of lines leading up to a series 
limit. In practice the lines will be broadened by heat motion. 

Attempts have been made to account for the absorption spectrum 
of alkali halides along these lines. 

t J* Frenkel, Phys. Rev. 37, 17 (1931). 
j G. H. Wannier, ibid. 52, 191 (1937). 
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An alternative description of the exciton may be given, which corre¬ 
sponds more closely to that of Frenkel’s original paper. In this we 
consider a crystalline array of atoms, numbered 1, 2,..., r,..., n. Let 
fa{x) denote the wave function of an electron in the ground state of 
the atom r, and fa r (x) the corresponding wave function for some given 
excited state. Let x F r (# 1 ,...,a;„) be the wave function describing the 
whole system of n atoms when all the atoms are in the ground state 
except atom r; this can be formed by setting up an antisyminetrieal 
determinant from functions of the type 

<Pl( X l)<p2 («s)-0r(*r)-0»(*«)- 

Then it can be shown that T r is not a stationary state of the system, 
but that the stationary states are of the form 

r 

where R r is the position of the rth atom. A wave function of this type 
represents the place where the atom is excited, moving about the lattice 
with wave-number k. In other words, an excited atom hands over its 
energy to a neighbour, the process is repeated, and so on. The place 
where the excitation is at any moment is called the exciton. 

41.5. Contact between a metal and an insulator 

If a metal is in contact with an insulator, we may introduce a ‘work 
function’ fa which represents the work necessary to take an electron 
from the metal into the conduction band of the insulator (Fig. (50). 
fa will not, of course, be equal to the work function cf> which is defined 
as the work necessary to remove an electron from the metal to vacuum 
across a clean surface. Experimental evidence indicates that usually 
fa is less than <£. From the theoretical point of view the image force 
is |e 2 //cr 2 instead of \e 2 \r 2 , so the contribution from that term to the 
work function will be very much smaller. 

Given two metals of work functions (f>, fa against a vacuum and fa, 
fai against a given insulator, it is tempting to assume that 

(f)—fa = fa—fai, 

but it is doubtful if this is a good approximation, and arguments can 
be given to suggest that <f> ( —fai has the same sign as <f>—fa , but is 
considerably smaller. The field at the boundary of a metal will be 
changed if another solid is brought into contact with it. 

It will be assumed that fa is always positive. 

A non-metal with no electrons normally in the conduction band will 
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act as an insulator in contact with a metal under the following 
conditions: 

(а) <f> { is too big for thermionic emission. 

(б) The applied field is not great enough for strong field emission. 

It is possible that another factor in the action of insulators is the 
presence of‘traps’, so that any electrons which get into the conduction 
band set up a space charge which repels other electrons. 

41.6. Contact between a metal and a semi-conductor 

If the non-metal considered above has impurity centres (e.g. dis¬ 
solved atoms) which can lose electrons to the conduction band for the 
expenditure of energy Q less than <f> v it is clear that the centres near 
to the metal will lose electrons to it. 

A positive space charge is thus set up in the semi-conductor, and an 
equal negative charge induced on the metal. The potential energy V(x) 
of an electron in the semi-conductor is thus as shown in Fig. 60(6). 
It is shown in Mott and Gurney, op. cit. Chap. VI, that <£', the contact 
potential between metal and semi-conductor, is equal to <f>i—\Q at low 
temperatures. 

The shape of the barrier may easily be calculated in the practically 
important case where Q In that case we may assume that all 

centres are empty up to the point A in Fig. 60(6); thus if N is the 
number of centres per unit volume, V{x) satisfies Poisson’s equation 

sas - 4 m 

dx 2 

At the point A , we may assume, since Q <^ that dV/dx = 0. Thus 
if A is a distance x 0 from the metal, V(x) is given by 

V{x) — 27rNe 2 (x 2 —2xx 0 ). (19) 

Putting x = x 0 in this equation and neglecting the difference between 
fa, </>’ we find ^ = 2 n Neh& 

which gives the width of the barrier layer. In practical cases 
x 0 ~ 10~ 6 cm. 

The barrier between a metal and semi-conductor may be expected 
to have strong rectifying properties, if the resistance is an appreciable 
part of the resistance of the whole circuit. The theory indicates that 
the direction of high resistance is for motion of electrons from metal to 
semi-conductor (it will be the opposite for defect conductors). The 
reason is made clear in Figs. 60 (c) and (d). Electrons are supposed to 
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go over the barrier with the help of heat motion (no tunnel effect); 
a potential difference equal to fa will remove the barrier preventing 
the motion of electrons from semi-conductor to metal, but does not 
affect the height of the barrier opposing motion in the opposite 
direction. 

The theory of the effect is as follows: 

Pigs. 60 (c) and (d) show that, if a potential VJe is placed across the 
barrier, V(x) is still given by (19) with 

fa+V 0 = 2ttN e 2 #§. 


The equation governing the current j crossing unit area of the barrier 


per unit time is 


j = —vn(x) 


dV(x) dn(x) 


dx 


dx 


Here n(x) is the density of electrons at a distance x from the metal, 
v and D are the mobility and diffusion coefficient of an electron. Since 


this may be written 

j 


Djv = JcTje, 


—vkT 


n dV dri 
kT dx dx 


This can be integrated by multiplying both sides by e vlkT ; we obtain 


X 

j J e V{x)Ikr dx = vkT[n(i))—n(x )]e v ^ kT . 
o 

We now take a value (#j) of x beyond the barrier; we find 

*1 

j — j e v ^ kT dx == vkT(n 0 —n 1 )e~^ Vo+< l > ^ kT , 

o 

where n x is the density of electrons far from the barrier. 
We note: 

(а) n x v = a, the conductivity of the semi-conductor. 

(б) Substituting from (19) for V(x)y we find 

Xi 00 

J e V(x)lkTfl x ~ j* e -inNe*xxolkT d x 

0 0 

— kT/47rNe 2 x 0 . 

(c) At points where V 0 — 0, j = 0. Hence n 0 = n x 
Hence, finally, 


j = 2 uV{2t TNe\Vz+fa}e-W kT {l-e-W kT }. 
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A barrier of this type seems first to have been described by Schottky.f 
For such barriers tunnel effect is probably unimportant, though for 
high voltages in the blocking direction there may be some field emission 
from the metal to the semi-conductor. 

When thin 10~ 7 cm.) insulating layers exist on the surface of 
a semi-conductor (as, for instance, oxide layers on silicon and SiC 
crystals), tunnel effect may play an important part in rectification4 

f W. Schottky, Zeits.f. Phyttih , 113, 367 (1936); ibid. 117, 574 (1941); see also review 
by J. Joffe, Electrical Communication , 22, 217 (1945). 

I C. Dilworth, Proc. Phyts . Soc. 60, 315 (1948). 
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IX 

COLLISION PROBLEMS 


The problems to be considered in this chapter are of the following type:)* 
a beam of particles, for example, electrons, is incident on an atom or 
atomic nucleus or other scattering centre; it is recjuircd to find the 

number of particles that are scattered out _ 

of the beam in a definite direction. For this -> 

purpose we define the scattering cross-section ** 

1(6) as follows: if the beam is such that v 
particles cross unit area per unit time, then 
the number of particles scattered per unit 
time through an angle 6 into a solid angle Flo 61 Showing 8(;attoring 
da) is vl(0)d(D . It will be noticed that 1(6) of particles thro ugh an anglo 0 
has the dimensions of an area; the scattering mto a solld angl ° dw ' 
is the same as it would be if the particles had to hit an area 1(6) dm 
in order to be scattered into the solid angle daj. 



71 

The quantity J 1(6) dm = J I(6)27t sin 6 dd 


is called the total cross-section for scattering. 

It will easily be seen that if the particles are moving through a 
medium containing N scattering centres per unit volume, the proba¬ 
bility per unit length of path that a particle will be scattered into the 
solid angle dot is j,,,,,*.. 


In the case of electrons incident on atoms, if their energy is greater 
than the first excitation potential, they may be scattered either 
elastically or after losing energy. In such a case we introduce the 
quantity 1 0 (6) to refer to elastic scattering and I n (6) to refer to scatter¬ 
ing after excitation of the atom to the state n; the quantity NI n (6)dm 
thus gives the number of particles scattered per unit time into the solid 
angle dm after raising the atom to the nth excited state. 


t General references for the subject-matter of this chapter are: N. F. Mott and H. S. W. 
Massey, The Theory of Atomic Collisions (Oxford, 1933); J. H. Me Mi lien, ‘Elastic 
Electron Scattering in Gases’, Rev. Mod. Phys. 11, 84 (1939)—a review both of theory 
and experiment on the subject; Handbuch der Physik , 24/1 (Berlin, 1933), article by 
H. Bethe, especially for discussion of stopping power. 
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43. Scattering by a centre of force 

The simplest collision problem is that of a stream of particles incident 
on a centre of force; by this is meant a region of space in which the 
potential energy of a particle is not zero but some function V(r) which 
tends to zero as r tends to infinity. If, for example, V{r) is equal to 
ZZ'e 2 /r, then the problem is that of a beam of particles each with 
charge Z'e incident on an atomic nucleus with charge Ze. To a good 
approximation the scattering of a beam of fast electrons by an atom 
can be treated by taking for the field of the atom a screened Coulomb 
field, such as that for which 

Ze 2 

V(r ) = —exp(— qr). 
r 


The Schrodingcr equation for a particle in such a field is 

W+%{W-V(t)W-~Q 9 ( 1 ) 


where W is positive, being equal to the kinetic energy of an electron 
at a point remote from the atom. Writing 

k 2 — 2mW/h 2 , U(r) = 2 mV(r)/h 2 , 
this takes the form V 2 0+{A; 2 — V(r)}\ft = 0. (1.1) 

To describe the scattering it is necessary to find a solution that will 
have the form of an incident wave and a scattered wave. Thus a solu¬ 
tion must be sought which, for points distant from the scattering centre, 
has the asymptotic form 

i/j ^ e iks -\-r~ 1 e ikr f(9)-\-term^ of order 1/r 2 . (2) 

If this is multiplied by the appropriate time factor e~ 2nivt , it is seen 
that the first term represents a plane wave advancing from left to right 
along the z-axis. This wave describes a beam in which there is one 
particle per unit volume so that the number of particles crossing unit 
area per unit time is v (= hkjm). The second term represents an out¬ 
going wave; in this \f(0)\ 2 jr 2 is equal to the number of particles per 
unit volume at a distance r from the origin, and thus the number per 
unit time incident on an area A is 


v\f(6)\ 2 A/r 2 . 

It follows that the number scattered into the solid angle da> is v\f{6)\ 2 dw 

/(•)= l/W I 2 * (3) 


and hence that 


The scattering is thus completely determined if/(0) is kpown. 
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43.1. Exact solution of the wave equation representing an incident and 
a scattered wave 

In this section we shall show how a solution of equation (1) of type 
(2) can be obtained. We require first an expansion of the plane wave 



e iks in a scries of spherical harmonics. The required expansion is, 
writing z = r cos 6 , 

gtfcrcosf? _ 2 (2Z+l)»'/$(cO8 0)/,(r), (4) 

z=o 

where f,(r) = ^fj l+i (lcr) (4.1) 


and J l+ t is the Bessel function of order For the first of these 

functions / Q ( r ) = (kr^sinkr. 

The asymptotic form of f(r) is, for large r, 

fi( r ) ~ (kr)-hin(kr—%lir). (5) 


Plots of f 0 (r) and / 10 (r) are given in Fig. 62 for Jc = 1. 

A proof of these propositions is given in any text-book on Bessel 
functions*! A proof of enough of them for the purposes of this chapter 
may be obtained as follows: equation (4) may be regarded as an ex¬ 
pansion of the left-hand side in a series of Legendre functions i^(cos 0) 


t For example, G. N; Watson, Bessel Functions . 
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with the functions f t (r) unknown; then multiplying both sides of (4) 
by P t ( cos 0)sin 0 and integrating with respect to 0 from 0 to n, we obtain, 
writing cos 0 = t, 

41 

Mr) = ^ J e ikrt Pi{t)dt. (6) 

-1 

This equation defines /,(r); it also enables an asymptotic expression to 
be obtained; integrating by parts, equation (6) gives 

M r ) = ~[c iftrt i?(<)]j“^+torin8oforderi. 


Thus, since 1*(1) = 1 and P t (— I) = (—1/, we find 

fi(r) = {kr)-Hin(kr-~\lTT), (7) 

which is identical with equation (5). 

Turning now to the equation (1) for a particle in a central field, we 
see that the general solution of this equation with axial symmetry 
must be of the form y 0)*J(r), (8) 

where the A t are arbitrary constants and the function F t is the bounded 
solution of 

This may be written 

-I (rF )+{**- U(r) - = o. (0) 


it follows that if U(r) tends to zero sufficiently rapidly as r tends to 
infinity, then the bounded solution must have the form at infinity 

F t ~ {kr)- l &m(kr— ^ 77 +^). (10) 


The phase r\ x can in general only be determined by numerical integra¬ 
tion of equation (9), though approximate methods of evaluating them 
are given on p. 238; the term \lir is added so that ^ shall vanish if 
U(r) is zero. 

The constants A t in (8) must now be chosen so that it shall have 
the asymptotic form (2); in other words, they must be chosen so that 

2 A l P l (co$0)F l (r)—e iks (11) 

shall represent an outgoing wave only. Substituting from (4) for e iks we 
obtain for (11) 


2 P,( COS B){A,F l (r)-(2l+\)i l Mr)}. 
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Substituting the asymptotic forms of F^f h we have for the term in the 
curly brackets 

(Ar)“ A {ili sin (hr— |/7 t+ t^)—(2 Z+1 )i!sm(kr— Urr)}. 

This may be written 

-i- \{A t e i7 »—(2l +1 )iY kr ~ iM ~{A l (2l+l)i l }e~ ikT+llni ]- 

2 ikr L 

In order that this may represent an outgoing wave only the second 
term must vanish; we set therefore 

(2l+l)i l e^ 9 

and the scattered wave takes the form r~V ,AT /(0) with 

m = ~ 2 ( 2 Z+ m 2ir "~ i )p t ( cos e). (12) 

K z-o 

This formula for the scattered amplitude was first given by Faxen 
and Holtsmark.f Further details are given by Mott and Massey, 
op. cit., Chap. II. 

Certain theorems of importance about the phases rj l will now be 
proved. 

(a) The expression (12) for the intensity of the scattered wave does 
not depend on the assumption of a static field of force, but can be 
deduced generally from the conservation of particles if there is no 
inelastic scattering. The proof is fairly obvious and will not be given 
here. 

(b) For slow particles, of wave-length (ti/mv) large compared with 
the radial extension of F(r), the scattering is spherically symmetrical 
so that only rj 0 needs to be considered. 

(c) As v tends to zero, r) 0 in general tends to zero in such a way that 
the scattered cross-section remains finite. In one particular case, that 
of an attractive field in which the incident particle is capable of being 
held by the scattering field in a bound stationary state of exactly zero 
energy, r) 0 remains finite as v tends to zero. Therefore the scattered 
cross-section given by (12) tends to infinity. 

The proof is as follows: if a bound state of zero energy exists in the 
field of potential F(r), this means that 

d(rF)/dr = 0 

at the boundary of the field. But it follows from this that, as k tends 
to zero, rj 0 tends to t. 

t H. Faxen and J. Holtsmark, Zeits.f. Physik , 45, 307 (1927). 
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(d) Various approximate expressions may be found for the phases rj t . 
In the first place let us write 

9i(r) = krMr), G x (r) = krF t (r ). 

Thus, from equation (9), it follows that 


n d% d 2 G x n TT 

0 'T*- g ‘w =e ‘ mr)g ‘- 


Integrating from zero to r, we find 

r 

= j °i( r ') U ( r ')9i(r')dr\ 

0 

We assume that U(r) tends to zero faster than l/r, so that the integral 
on the right converges. Then, inserting the asymptotic forms (7) and 
(10) for G t and g h we see that 

00 

sinrjj = -jj. J G l (r)g,(r)U(r)dr. (13) 

0 

This formula is exact. An approximation valid when l is large is to 
replace G x by g t so that 

00 

sin 7], ~ - i J [g,{r)fU(r) dr 
0 

oo 

= V(r)\J l+i (kr)frdr. 

0 

If this approximation is used for all l , and sin rj t is put equal to rj l9 we 
obtain for/(0) 

(14) 

0 

where K = 2ifesin|0. 

This follows from the expansion! 

^ = I (2?+1 )i)(cos 6){fi(r )} 2 . 

Kr i^o 

Equation (14) is known as the ‘Bom approximation’ to the scattered 
amplitude; in practice it is seldom a sufficient approximation. Bom’s 
original derivation is given in the next section. 


t G. N. Watson, loc. cit., p. 363, 
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An alternative and improved approximate method of determining 
the phases has been given by Pais,f as follows. A suitable solution of 
equation (9) when there is no interaction is 


9i(r) = (l”kr)tj l+i (lcr). 

Pais takes the effect of the interaction into account by using a solution 
of the type <7,(r) = (frkr)*J l+x+i (kr), 


where A is to be determined from equation (9) by a variation method. 
He is then able to show that 


gj+A +1 v 
21+2X+1 


00 

f? J Vm+niWfrdr. 


0 

If this equation is solved graphically or by a method of successive 
approximations, then the phase is given by 

Vi = —frh- 

HulthenJ lias used a variational method to obtain rj if 


o 

and 0 is a function containing undetermined parameters c, then opti¬ 
mum values will be given by the condition dL/dc = 0. Hulthen takes 

^ or ^ 0(r) = cosr/sin^r-f (1—e~ ; ' r )(l+Ae“ Ar )sin rj cosier, 

with h as an adjustable parameter, in finding the phase r) 0 with a field 

of the type V(r) = — br~ l e~* r used in Meson theory. 

Other methods of obtaining phase shifts will be described in the 
second edition of Mott and Massey, op. cit. (to appear shortly). 

Massey and Buckingham§ have worked out the scattering to be 
expected from a large number of fields which occur in nuclear theory. 


44. Born’s approximation for the scattering of particles by a 

centre of force: alternative treatment 

For the scattering of particles by a centre of force, it is possible, as 
shown in the last section, to obtain an exact solution of the wave 
equation. An important approximate method was first introduced by 
Born,|| in which the field V{r) is treated as a perturbation. This is 
known in the literature as the Born approximation. 

t A. Pais, Proc. Camb. Phil. Soc. 42 , 45 (1946). 

% L. Hulthen, Fyaiogr. Sallsk . Lund. Forhandl. 14 , 21 (1944). 

§ H. S. W. Massey and R. A. Buckingham, Proc . Roy, Soc. A, 163 , 281 (1937). 

|J M. Bom, Zeits.f. Physik, 37, 863 (1926) and 38 , 803 (1926). The former paper is the 
first in which the probability interpretation of 0 was introduced, and is the first dis¬ 
cussion of a collision problem in wave mechanics. 
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We make use of the fact that a solution of the inhomogeneous 
equation Vf+ty = F(x,y,z), 

where F(x } y,z) is a known function, is 

1 /* pik\T-r'\ 

*—hl 

For a proof see, for example, Mott and Massey, op. cit., Chap. VI, 
or any text-book on electromagnetic theory. 

If we write the wave equation in the form (1.1), namely 
V 2 tf,+{Jc 2 -U{r)}i/j = 0, 
and treat U ( r) as small, then we may set 

xjj = e ikz +fa 

and neglect the product Ufa. Substituting for i/j we obtain therefore 

V 2 fa+Jc 2 fa = U(r)e ikz , 

of which a solution is 



It is easily seen that this solution behaves at infinity like an outgoing 
wave; the asymptotic form is in fact 

pikr 

<Pi ~ — /W> 

r 


where f(d) = —— f e- ik{nr)+ikz U(r) dr 

47 t J 

with n = r/r. 

A convenient way of writing this is 

AS) = - ~ J e« k - k - r >F(r) dr, (15) 


where k 0 , k are vectors of magnitude k in the directions of motion of 
the particle before and after scattering. 

It will be noticed that the same formula can be obtained by the 
methods of Chapter X for the probability per unit time that a particle 
makes a transition from the initial state with momentum k 0 and wave 
function e ikz to a final state with momentum k. 

Formula (15) may be simplified (when V(r) is spherically sym¬ 
metrical) by taking polar coordinates with axes in the direction of the 
vector k 0 —k; we thus obtain 


f(6) = ^ J J 4*™*V{r)\2ir sin 6' dd’ r 8 dr, 



§44 


BORN’S APPROXIMATION 


241 


where K — |k 0 —k | = 2k sin \0. On integration with respect to O' this 
gives 

r sin Kr J7 , » « , 

/ W= -_J ___F (r)r 2 dr 

, 0 
as m formula (14). 

Ze 2 

For example, if V(r) = —exp(—gr) (screened Coulomb field), this 


gives 


m 


Ze 2 


- cosec 2 


2mv 2 2 ~ I + to/2*sinifl) a * 

If <//& is small, the scattered intensity tends to 

/ Ze 2 \ 2 


*(«> 


\2 mv 2 ) 


cosec 4 


which, as seen in the next section, is the classical expression for the 
scattered intensity. 


44.1. Scattering by a Coulomb field 

If a beam of charged particles, each carrying a charge Z'c, and such 
that one particle crosses unit area per unit time, falls on a single nucleus 
of infinite mass and charge Ze, then according to Newtonian mechanics 
the number of particles l(0)dcj scattered per unit time through an 
angle 0 into a solid angle dco is given by 

(ZZ'e 2 \ 2 

7W = (^) c08ec4 ^ 

The proof of this, the Rutherford scattering formula, is given in any 
text-book on nuclear physics. 

The same formula is exactly valid in (non-relativistic) wave me¬ 
chanics, provided 

(а) the Coulomb law of interaction, V(r) = ZZ’e^jr, is assumed to 
hold down to all distances; 

(б) the velocity of the particles is small compared with c, the velocity 
of light. 

Classical results are also valid for the collision between two charged 
particles of finite mass, except when both particles are identical (two 
a-particles or two electrons, cf. Chapter V). 

For details of the proof of this the reader is referred to the original 
papers or to Mott and Massey, op. cit., Chap. III. One has to obtain 
a solution of the equation 

T72/ . (7 2 2 me 2 ZZ' l\ , 

w + |fc . w~ rr = 

R 


6072 
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representing an incident and scattered wave. The solution may be 
shown to be ^ = F (-i«; 1; ik(r-z)), 

where F is the hypergeometric function defined by 


and a = ZZ'e 2 /ftv. 

The asymptotic form is «/r = I+Sf(6), 

where I represents an incident wave and is given by 

JT __ giks+i<x\o«k(r-z) 


and 8, the scattered wave, is given by 

$ __ y-lgikr-iotlogkr, 

77’ pi 

and f(6) = —— cosec 2 16 e^ lo ^ 1 - CO80 )+^+ 2l >, 

2 mv 2 

where e 2 i 7 i° = r(l+icx)/r(l— ioc). 

It will be seen that neither the incident nor the scattered waves have 
the normal form; this is due to the slow convergence of the Coulomb 
field. 

On forming the scattered intensity 1(6) = \f(6)\ 2 we see that the 
classical formula of Rutherford is obtained. Planck’s constant only 
arises in the phase of /, which is not observable (except in the collision 
between identical particles; cf. Chapter V). 


44.2. Collisions between electrons and atoms 

Up to this point we have considered only the scattering of electrons 
by a centre of force. We have now to outline a theory of the scattering 
of a beam of electrons by an atom, in which the interaction between 
an incident electron and the electrons in the atom is taken into account. 
In the subsequent analysis we shall discuss the scattering of a beam 
of electrons by a hydrogen atom; the extension to a many-electron 
atom is straightforward. 

We denote the Schrodinger equation for the hydrogen atom by 
{H(p,r)-W}4( r) = 0, 

where H is the usual Hamiltonian operator and r the coordinate of 
the electron. Let W n , ifj n be the characteristic energies and correspond¬ 
ing wave function. Then the Schrodinger equation for an additional 
electron interacting with the atom is 


(16) 
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Here r v r 2 are the coordinates of the tw o electrons and 


P( r i.r 2 ) = --- + 


r,—r a 


The Hamiltonian is, of course, symmetrical in the coordinates of the 
two electrons. 

To obtain an approximate solution, we expand T(r 1 ,r 2 ) in a series 
of characteristic functions i/j n ( r 2 ), 

= 2F n (. ri)<A„(r 2 ). 

n 


Equation (16) then becomes 


2 {-^ V ?+ F(r !- r 2 ) +^- W r )^(ri)-A„(r 2 ) = 0. 


Multiplying by </r*(r 2 ) and integrating over all r 2 

= { «(r 1 )K(r 1 .r,)Y(r 1 ,r 1 )dT 1 . (17) 

If we denote the function on the right by ^ /t (r l ), a solution for F n 
behaving at infinity like an outgoing wave can be obtained, so long 
as W—W n is positive; for if we set 


the solution is 


K 


F„ 


2m(W—W n )/h 2 9 

£tfc n l r i- r al 


1 r e ik * lr i- 

.__j __ 


2 ) ^ r 2> 


which, as shown in the last section, behaves in the required way. 

To obtain explicit expressions for F jn we must make some approxima¬ 
tion for ¥*( 1 *!, r 2 ) in the right-hand side of (17). We require that T(r ly r 2 ) 
should represent an incident wave falling on an atom in the ground 
state; the wave function representing this will be 

e ik ° z ufj 0 (r 2 ). (18) 

Owing to the interaction term scattered waves will also be formed. 
The simplest approximation will be to substitute (18) forT(r 3 ,r 2 ) in 
the right-hand side of (17); this procedure is known as Born’s approxi¬ 
mation. 

Equation (17) thus becomes 

{V*+k*}F n (r) = ^Ur)e«.*, 

Jo»(r) = J <l'n( r i)V(r v r 2 )^ 0 (r a ) dr 2 . 


where 
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This equation is solved just as in the last section, and an asymptotic 
expression obtained; we find 

F n (r) ~ r-W/„(M)> 

where 1,(9, <f>) = - ~ J e'( k o- k - r )F 0 „(r) fa. 

Here k 0 is the wave number of the electron before the collision and 
k n afterwards. 

The obvious interpretation would be that 

A = !M/«(M)I*, 

A o 

I v da) being the cross-section for scattering into the solid angle dcu after 
exciting the atom to the nt\\ state. This interpretation would, how¬ 
ever, neglect the possibility of a ‘rearrangement collision’, i.e. a collision 
in which the incident electron is captured by the atom and ejects 
another one in its place. To account for these we can expand x F(r v r 2 ) 
alternatively in the form 

^( r i>r a ) I O n ( 

and obtain 

{V 2 +Ai}(? 1( (r 2 ) = |"V. 0 (r 2 ) J e^*F(r J) r 2 )^(r 1 )dr 1 , 

whence an expression for G n representing an outgoing wave can be 
obtained as before; then if 

0 H ~ r-H ik « r g n (0, <f>) 

we might suppose that dcoh n \g, n \ 2 /Jc 0 was the partial cross-section for 
-a rearrangement collision. Actually, however, this interpretation is 
only valid if the two electrons have opposite spins, and so are distin¬ 
guishable; if the spins are parallel a wave function antisymmetrical in 
the space coordinates must be used, of the form 

^(r^r,)— ¥(r a , r x ). 

It may be shown f that the cross-section for scattering is then given by 

Among the many applications of these formulae, we mention only 
some work on elastic collisions. Neglecting the term g representing 

t The formula was first given by J. R. Oppenheimer, Phys. Rev. 32, 361 (1928); see 
also Mott and Massey, op. cit., p. 113. An analogous formula for the simpler case 
of the collisions of one free particle with another is given in Chapter V of this book. 
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rearrangement collisions, it will be seen that the field T^ 0 (r) responsible 
for elastic scattering is just the electrostatic field due to the charged 
nucleus and a uniform negative charge distribution — e l0o( r ) I 2 - 

For hydrogen this may be evaluated to give 

F 0fl (r) - 

V a o! 

where a 0 = b 2 /me 2 = 0*54 x 10 -8 cm. For other atoms the best available 
approximation is to use the potential given by the method of the self- 
consistent field (Hartree field).*)* 

For atomic hydrogen an analytic expression for the elastic scattering 
may be obtained; this is 

j(B\ = 4(g +«5Jr*)* 

{) '4(i+a 2 K*r 

where K = |k x —k 0 | = 2fcsin^0. 

It will easily be seen that an improved approximation to the elastic 
scattering is obtained by finding the exact solution of 

vv+{f-^Fo(/-)}</' = 0 

instead of using Born’s approximation; this amounts to inserting 
F 0 ( r i)«/f 0 ( r 2 ) instead of e ?:/fo ~*i/r 0 (r 2 ) in the right-hand side of (17). This 
method is the one which has actually been most used in calculating 
electron scattering in gases. 

For references to the many papers on the subject up to 1938, see the 
second general reference at the beginning of this chapter. Among more 
recent work Massey{ and his co-workers have estimated a self-consistent 
field for methane by averaging the proton distribution over all direc¬ 
tions, and have used this field to calculate the scattering of slow 
electrons; the results are in quite good agreement with experiment. 

44.3. Relative intensities of elastically and inelastically scattered 
-particles 

The following considerations show that it is impossible to obtain 
inelastic scattering from an atom (or nucleus) unless there is some 
elastic scattering present too. We shall confine ourselves to the case 
where the dimensions of the nucleus are small compared with the wave¬ 
length, so that the scattering is spherically symmetrical. 

f Cf. Chapters V, VI; an analytic expression for the field of helium is given by E. A. 
Hylleraas, Zeits. f. Physik , 54, 347 (1929). 

J R. A. Buckingham, H, S. W. Massey, and S. R. Tibbs, Proc . Roy. Soc. A, 178, 119 
(1941). 
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A spherically symmetrical solution of the wave equation of the 
system must then have the asymptotic form 

Y(r,q) = r-\e~ ikr >/i 0 (q)+A e ikr ifi 0 (q )-\-Be ik ’ r ifi e {q)]. (19) 

Here <p 0 (q), ip e (q) represent the wave functions of the atom (nucleus) 
in its normal and excited states, k, k' the wave numbers of the elastically 
and inelastically scattered particles. The wave function representing 
an incident plane wave and scattered wave is 

where a, chosen to make terms in e~ ikr vanish, must be —1/2jfc. Thus 
the elastically scattered intensity is 


4 jfe* 


(1+A) 2 


and the inelastically scattered intensity 

/c = 4 ¥ 1 ^’ 

Since, however, the conservation of charge applied to equation (19) 


gives 


1 - \A\^^\B\\ 


we see that /„ cannot vanish unless A — — 1 and I e vanishes too; and 
if I f has the largest possible value (A — 0— 1/4 k 2 ), then 

h = 1 / 4 ^ 2 . 

Such a formula would, for instance, apply to the scattering of slow 
neutrons by a nucleus of very strong absorbing power. 
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45. Introduction 

In this section we shall show how to solve problems of the following 
type. An atom, or molecule, or any other system, is initially known 
to be in a given stationary state, denoted by the suffix zero, so the 
energy of the state is W 0 . It is then acted on by a variable external 
field for a certain time, say by the field of a passing a-particle or gas 
molecule, or by a light wave. At the end of that time it may have 
made a transition to any other state, with energy W n . We require to 
calculate the probability Po, that the transition has occurred. 

The physical principles by means of which the calculation is made 
are as follows: The wave equation for the unperturbed system (atom 
or molecule) is 

where // 0 is the Hamiltonian operator of the unperturbed system. The 
solutions of this equation are 

= VM *- 1 '™'S 

and the system is supposed initially, at time t = 0, to be in the state 
with wave-function In the presence of the perturbing field let the 
wave equation be && 

= (#o + W (i) 

The quantity U is the addition to the Hamiltonian due to the perturbing 
field, and will be a function of q , d/dq, and of the time. For instance, 
if we are dealing with the perturbation of a hydrogen atom by the field 
of a passing a-particle, U is the potential energy of an electron in the 
field of the a-particle. 

The equation (1), being of the first order in t , enables us to determine 
at any subsequent time t . Suppose at time t we expand as a 

SerieS = (2) 

n 

Then, by the usual interpretation of wave mechanics, the quantity P n 

8iv,n by P. - K«)l* 

is the probability that at time t the system is in the state n. 
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To obtain the coefficients a n (t), we substitute (2) into (1); we obtain 

T * (? ’ 0= 

n 

Multiplying both sides by \F*(g, t) and integrating over all q, this 
equation reduces to 

= j W*(q,l)UnqJ)dq. (3) 

To obtain an explicit solution we must in general assume that the 
perturbation is small, i.e. that during the perturbation the change in 
l F(r/, t) is not large. With this assumption we replace ¥*(<?, t) in the right- 
hand side of (3) by its original form Y 0 (g,£); we thus obtain 

= jw*(q,t)U%(q,t)dq. 

Writing J T*(g) U%(q) dq = (n | U |0), 

t 

we see that a n (t) = ~ J (n\JJ\0)e i{w o- w ^ fi dt. (4) 

o 

We note that in this expression (ra|i7|0) is in general a function of t . 
Certain generalizations can be made at once from the expression (4): 

(a) If the perturbing field varies slowly with time, so that many 
oscillations of the exponential function in (4) occur while U 
remains sensibly constant, the transition probability will be small. 

(b) If U is periodic in the time with period v (as, for instance, if U is 
the perturbing energy due to a light wave), then a n (t) will not 
increase with the time unless hv = W Q ~W n . We see therefore that 
for the excitation of atoms or molecules by light the Bohr fre¬ 
quency condition can be deduced from the theory, and that if 
V is independent of the time, transitions to states of equal energy 
only occur. 

45.1. Excitation of atoms by a passing a-particle 
In this section we shall show how to calculate the probability that 
an atom is excited by an a-partide which passes a long way from it. 
Actually the first attempt*)* to calculate the stopping power of matter 
for a-particles was made in this way; but the method used in Chapter 
IX, in which the motion of the a-particle as well as that of the electrons 
is treated through a wave function, is much more convenient. The 
t J. A. Gaunt, Proc . Camb . Phil . Soc . 23, 732 (1927). 



INTRODUCTION 


249 


§ 45 

calculation is included here as being the simplest and most instructive 
example of the calculation of transition probabilities, not for its practical 
value. 

The a-particle is supposed to move with constant velocity v past a 
hydrogen atom at the origin of coordinates, passing it at a distance p. 
If we take i = 0 to be the moment when the a-particle is nearest to 
the atom, then the perturbing energy U at any other time is, for an 
electron at the point (x, y, z) 

— Ze 2 

U = MX-W+(Y-y)*+{Z-zf )• 

Here Ze is the charge on the a-particle, and X , F, Z are the coordinates 
of its position at time t , given by 

X = vt, 
r = p, 
z = o. 

Expanding U for small values of x } y, z we have 

Since we have assumed that p is large compared with the radius of the 
atom, only the first terms in the expansion need be taken. 

The quantity a n (t) is thus given, after the a-particle has passed 
(t -> oo), by 

= Ze2 J 

— 00 

For the evaluation of these integrals, the reader is referred to the paper 
by Gaunt already quoted. 

45.2. Transitions to unquantized states 

A perturbing field (e.g. the field of a passing charged particle, or a 
light wave) may ionize an atom or molecule; in other words, transitions 
may be caused to states with unquantized energy. To calculate the 
appropriate transition probabilities, the simplest procedure will be to 
introduce fictitious quantization by enclosing the whole system in a 
large box, of side L , say. For instance, suppose the final state of the 
particle is represented by the wave function of a free particle 

^(r) = e* kr >. 

Then if we introduce our box of side L, together with the boundary 
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condition that shall be periodic with period L, the normalized wave 
function is 

1 


jb 


e *(k r ) (k = 27rn/Z; n = (n v n 2 ,n 3 )). 


If we write (n|l/|0) = ~ J e-^lT¥ 0 (r)dr 9 

then the transition probability to the state n is given by \a n (t)\ 2 with 
a n (t) given by (4). For values of k lying in the limits between k x and 
k x +dk v k 2 and k 2 -\-dk 2 , k 2 and k 2 -\-dk 2 , the number of states is 

L 3 dk x dk 2 dk s /( 2n) z . (5) 

Thus if (k | U' |0) = [ e- <(kr >(/T 0 (r) dr, 

the chance that the electron is ejected into a state with wave numbers 
in these limits is 

dk x dk 2 dk^ .. \ 


( 2 t rf 


>'(*) | 2 , 


t 

where «'(<) = 1 J (K \ U’ |0)e«"WW* dl 


( 6 ) 


45.3. Transitions due to a perturbation periodic, in the time 
This case is of great importance because of its application to the 
action of light waves on atoms and molecules. 

We suppose that the perturbing term which has to be added to the 
Hamiltonian of the system is of the form 

U — acos(27 rvt+c), 

where a } e are functions of p and q but do not contain the time. It is 
convenient to write this in the form 

(J = Fe 2nivlJ ^ F*e~ 2nivt 

where F is a complex function of p and q and F* is its complex con¬ 
jugate. Formula (4) then gives for the probability P n that at time t a 
transition has been made to the state n of energy W n 

Pn 

where 


2 

ih 


+ (n\F*\0) 


; KWI 2 . 



I dt+ 

t 

1 | expjtl 


J \ \ 

0 

I* j 


(7) 
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(w.|jP| 0) = J ?)%(?) d?, 

which is not a function of time. 

Unless \W n —W 0 \ = 2rrfiv 3 (8) 

the integrals in (7) do not increase with the time. The sudden im¬ 
position of the field will cause transitions, but after a time equal to 
1 Jv the probability that a transition has taken place does not grow any 
greater. Only values of the frequency for which (8) is satisfied need 
therefore be considered. 


If we write r 0w = {W Q — W n )/27rh 


and suppose that v lies in the neighbourhood of r 0 „, then only one of 
the two terms of (7) will be significant. Carrying out the integration 
with respect to t we find 


and hence 




(9) 


If v is equal to v 0 „, the probability P n (t) that a transition has occurred 
increases as the square of the time, instead of linearly with the time 
as one would expect. However, this apparent contradiction can be 
resolved as follows: 

If the final state is quantized, we shall have to consider perturbation 
by radiation with a band of frequencies extending over the natural 
width of the absorption line. This is done in §46 and it is shown that 
a transition probability proportional to t is obtained. In this section 
we suppose that the final state is unquantized—in other words, that 
the calculation refers to the ionization of an atom by a light wave or 
other periodic disturbance. The fictitious quantization of §45.2 may 
be introduced, the atom being enclosed in a ‘box’ of side L. Then 

|(tc|^|0)| 2 = -1| J e-^F%(r)dr\ 

and the number of states for which the particle is moving with its 
momentum in the solid angle dQ about the vector k, and with energy in 
the range dW 3 is by (5) 


&dQmih-*{2W)*dW = p{W)dW, say. 

To obtain the probability that, after time t , the particle is ejected 
in a direction lying in the solid angle dLX we have to sum over the 
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transition probabilities to all these states; this gives, making L tend to 
infinity so that the summation is replaced by an integral, 

dQ j P n (t) P (W n )dW n . 

Now the quantity P n (t) has a very strong maximum when W n passes 
through the value for which v—v Qn vanishes, and for large t practically 
all the value of the integral comes from here. Thus in all terms in 
p(W n )P n (t) except the term involving the time factor, we may set W n 
equal to this value, i.e. 

W n - W 0 +2nflv. 

The time factor gives 

f 2{1 —cos 2 tt (v—v 0ll )t} dw 


J 

or with 27r(v—v 0n ) = x , 


0„)} 2 


{2ir(v v 0 
j2(l-co S xt) dX ' 


When t is large the limits of integration may be replaced by ±00 and 
we obtain 


fit 


J 


2(1 —cost/) 


dy. 


The integral is equal to 2tt ; thus, finally, the probability that a transition 
will have taken place after time t is 

m\(n\Fm*p(W n ), (10) 

which increases linearly with t as it should. 

The probability per unit time that the electron is ejected with its 
momentum in the solid angle dfi is 


e-^ FYo (r) dr 


45.4. Transitions caused by a perturbing term independent of the time 

This is a special case of the problem considered in the last section, 

the frequency v being taken to be zero. 

Formula (9) remains valid. Since V is now independent of t, it may 

be written •, ofl _ 0 A 

Pn(t) = plM^IO)! 8 — .| - 0 - s2 ^" t} . 

W (2 7TV 0n ) 2 

Here U is the perturbing potential and the matrix element (n| t/|0) is 
formed with normalized wave functions. When t is large the function 
on the right has a strong maximum for v 0n equal to zero, i.e. for transi¬ 
tions in which energy is conserved. 
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If, as before, we introduce the function p(W)dW to denote the number 
of states with energy between W and dW (and if necessary subject also 
to other conditions, such as having the direction of momentum in a 
given solid angle), then the probability after time t that a transition 
has occurred is 

f P n (t) P (W)dW. 

As t tends to infinity, this gives as before 

t\(n\U\0)\* P (W)/K (11) 

for transitions in which energy is conserved. For other transitions it 
does not increase with t. 

The following is an example: A beam of electrons is incident on a 
centre of force in which the potential energy of an electron is V(r). 
If we normalize Tj,(r) to represent a beam such that one electron crosses 
unit area per unit time, then 

T* 0 (r) — 

Equation (10) then gives for the number of particles scattered into the 
solid angle dco per unit time 

™ f e« k >- k >- T V(r)dr “ dw, 

ZlTil* J 

which agrees with the formula found by another method in Chapter 
IX, equation (15). 

46. Emission and absorption of radiation 

The correct description of the intensities of spectral lines according 
to the quantum theory was first given by Einstein.f Corresponding 
to the transition between any two states n and m of an atomic system, 
he introduced three probability coefficients A nnv B nm , and B mn . The 
state n is supposed to be the state of highest energy, and the coefficient 
A nm is defined as follows: A nm dt is the probability that an atom, initially 
in the state w, will make a spontaneous transition in the time interval 
dt to the state m. This probability was supposed to be independent 
of the past history of the atom or of the process by which it had been 
brought to the state n. 

The coefficients B nm and B mn are defined as follows: suppose the 
atom is in the presence of radiation, unpolarized and incident equally 
in all directions, such that the energy density with frequency between 
v and v+dv is I(v)dv. Let v nm be the frequency corresponding to the 
transition between the states n, m. Then if the atom is in the state m 
f A. Einstein, Phys . Zeits. 18, 121 (1917). 
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(the lower state), the chance in a time interval dt that it will make a 
transition to the state n with the absorption of a quantum of radiation 
is B rnn I{v nm ) dt. Also, if it is in the upper state n , the chance that in 
the presence of the radiation it will make a transition to the ground 

8tatei8 {A nm +B nm l( Vnm )}dt. 

Einstein was able to prove the following relationships: 

B nm = B nm , A==^B. (12) 

c 

The proof is as follows: Suppose a number of the atoms concerned are 
in thermal equilibrium in an enclosed space at temperature T , together 
with the (black body) radiation in equilibrium at that temperature. 
Then as many atoms must make the transition upwards as make the 
transition downwards. Hence 


nm +B 

nm ^mn v nm )N m , 

where N. n , N m are the numbers of atoms in the tw r o states. Now in 
thermal equilibrium, by Boltzmann’s laws, 


N m 


e-W-WMM', 


which by the Bohr frequency condition gives 


N. 


Thus 


N, 

I\ v nrn) ^ 


21 — e -hVnJkT' 


A „ 


B e hv ™l kT —B * 

XJ mn ° "jim 

Comparing this with the well-known formula for the intensity of black 

8t rhv* dv 1 / 


body radiation, 


I(v) = 


^.3 - ghv/kT _]_’ 


we see that equations (12) are satisfied. 

In attempting a quantal theory of the emission and absorption of 
radiation we are faced with the following position: The theory of 
absorption and stimulated emission is relatively elementary; the light 
wave can be treated as an electromagnetic field periodic in the time, 
and the probability that the atom will make a transition from one 
state to another under its influence can be calculated by the methods 
of the last section. As we have seen, the Bohr frequency condition can 
be deduced from the theory, and the values of the Einstein B coefficients 
obtained. The A coefficient can then be deduced from equation (12). 
But this elementary theory does not by itself predict spontaneous 
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emission; in the absence of a field one cannot see why transitions should 
occur at all. To account for spontaneous transitions one must intro¬ 
duce a more complete theory in which atom and radiation are both 
considered as one system, and the radiation itself is quantized. 

The elementary theory will be given in the next section; the more 
complete theory in § 47 and the following sections. 

i 46.1. Elementary theory of absorption 

As explained in the last section, in this section we shall treat the 
action of a light wave on an atom as that of a perturbing electro¬ 
magnetic field. Such a theory will account for absorption, but not for 
emission,. 

A plane polarized light wave moving along the z-axis can be written 
in the form 

H v = = 6sin2 7 rv|?-<|, (]3) 

H X — Ey — 0 . 

It is convenient to describe this by a vector potential A and scalar 
potential <£, from which E and H can be deduced from the equations 

E= -J'¥- gra<1 *' 


H = curl A. 


If we set 


A x — a cos 

Ay — A z = 


27rv |-—tj (a = 6c/27rr), 


0 , 


it is easily verified that (13) is satisfied. The energy density 1 in the 
wave is given by 


/ - 


E 2 +H 2 _ t raV 
877 ~~ 2 c 2 


By equation (46) of Chapter I, the perturbing term introduced by 
a vector potential A for which div A = 0 is 

--- (A grad). 
imc v 6 

In our case this gives 


eha 
2 imc 


e *iriv(ztc-i) A ^-complex con jugate J. 


Thus the probability that after time t the atom has made a transition 
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from a state with wave-function tfi 0 to a state with wave-function ifi n 
is given by (9) with 

IW-WI 2 = 


where 


Qn 


/t; 


;)*«.> 


flirty z!c _ o 

dx 


Substituting in terms of the energy density we find for the probability 

P ” (<) P It) - —IMO 2 { i - cos 2 w ( t '- y o») < } 

n{ ) ' { 2 n(v-v 0n )f * 

As in the previous section, if we set v equal to v Qn we find that P n (t) 

increases as the square of the time. We have therefore to consider that 

the atom is irradiated by a band of frequencies, with energy density 

J(v) dv in the range dv. Since 


ou 

/ 


2{1 —cos27T(t>—v 0 „)/,} 
{‘2n(v-v 0n )t} 2 


dv — t. 


(14) 


:/v; 


a'F 

e 2nivzlc —dr 
dx 


we see that the probability per unit time that the atom makes a 
transition from the state 0 to the state n is 

f T( \ ^ 

2tt mv t 

It will be noted that the z-axis is the direction of propagation of the 
wave, the #-axis the direction of polarization. 

The Einstein ^-coefficient will be obtained by averaging over all 
directions of incidence and polarization. 


The integral 


J t •* 


Y*e 2 Ttivzlc°_±S! dr 
dx 


(15) 


thus determines the absorption and emission intensities. This quantity 
can be simplified, making use of the fact that for light absorbed in the 
line spectrum of an atom, the wave-length c/v of the light is large com¬ 
pared with the diameter of the atom. We may thus expand 

flirivzjc 1 | | 

~r c 

To a first order of approximation, therefore, the integral (15) reduces to 


/ 


n 8x 


By equation (21) of Chapter VII this may be written 

2t Tin 

IT 


-*wj V*x%dr. 
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Thus the probability of absorption takes the form 

Averaging over all directions of polarization we obtain for the 
^-coefficient « 2 

£ = |^(KJ 2 +|yoJ 2 +KJ 2 ), 

and by (11) for the A-coefficient 

A = ~ (^OnP+lyOnP+l^oJ 2 )- 

It is convenient to express these quantities in terms of oscillator 
strengths f n (Chap. VII, p. 169); we obtain 

B = A = 87rW/ w /3mc 2 . (16) 

This form is convenient because f n is dimensionless, and because 

2fn=h 

where the summation is over all transitions which start from a given 
state. 

46.2. Selection rules 

Suppose that the state 0 has orbital quantum numbers Z, u; in other 
words, that the part of the wave function involving 0 , cj> is 

JFf( cos0)e iu *. 

Suppose also that the state n has quantum numbers Z', u\ Then the 
matrix elements x 0n , y 0n , z 0n will contain integrals of the type 

COS0 

Pjf (cos 0)e~ iu ' < l > sin 0 cos <f> Pf (cos 0)e iu i sin 0 d0d<f). 
sin 0 sin <f> 

It is obvious that this integral vanishes unless 

u—u' = ±1 or 0. (17) 

The integral also vanishes unless 

Z—Z' = ±1. (18) 

This is shown in the appendix to this book; it is easily verified in the 
simple case where Z = 0. 

Equations (15) and (16) give the well-known selection rules for optical 
transition. If they are not satisfied the ‘dipole moments’ etc., 
vanish, and to obtain the transition probability we must then take the 
second term in the expansion (14); the integral (15) becomes 


jj 


2vv 

c 


j* 


,8% 

* dx 


dr . 


5072 


8 
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Transitions for which this quantity does not vanish are said to possess 
a ‘quadripole’ moment. It will be seen that for transitions with a 
quadripole moment the transition probabilities are smaller by a factor 
of order (radius of atoms/wave-length of light ) 2 than for those with a 
dipole moment. 

It is worth remarking that there is one type of transition for which 
the transition probability vanishes exactly; this is for transitions from 
one 5-state to another, so that both % and are spherically sym¬ 
metrical. 

, *' / 

' 46.3. Alternative treatment of emission 

For certain purposes, especially some of the applications of rela¬ 
tivistic quantum theory in Chapter XI, it is very useful to have an 
expression for the electromagnetic field radiated by an atom. The 
treatment given in this section is semi-classical, and can only be 
justified on the basis of quantum electrodynamics; it is none the less 
very useful. 

Suppose that in an atom there exists a charge density given by 


and a current j 


p = p 0 (x,y,z)e- 2 ” ivi +c.c. 
j = } 0 (z>y,z)e- 2 ™ ( +c.c. 


(19) 


Then the electromagnetic field radiated by this charge and current are 
given by the equations 


V 724 * d * A o 

VA ° c*~W 


— **P9 


V 2 A— 


i a 2 A 


= --j. 

8t 2 c J 


Here A 0 , A are the scalar and vector potentials. If we consider retarded 
potentials only, obtaining thus a solution representing an outgoing 
wave, the solution is 

p p2mv\r'-r\Jc 

A$ — e- inivl J —--p- p 0 (x',y',z’)dT'+c.c. 
with a similar expression for A. For large r this becomes 

p2iriv(rlc-0 /* 

A 0 = —-- e^ iv ^ c p 0 {x f ,y\z f )dr f . (20) 

Here n denotes the vector r/r. 

Suppose now that an atom is in an excited state 0 , and that we wish 
to know the field that will be radiated into transitions to lower states. 
We shall make the following new assumption: that corresponding to 
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any lower state n, the field radiated initially is that which would be 
given classically by a charge density p 0n and current density j 0n given 
by (19) with 

Po = eY n W*, 

Jo = g(Y n gradT*-Y*gradYJ, 
hv=W 0 -W n . 

For the corresponding expressions using Dirac’s relativistic equation 
cf. p. 309. 

It may easily be verified, using Schrodinger’s equations, that the 
continuity equation is satisfied: 

The field radiated is then given by (20). 

It may be verified, by obtaining the energy radiated per unit time 
and dividing by hv, that equation (16) for the A-coefficient is obtained. 

47. Proper treatment of the radiation problem : quantization of 
the field 

We have seen that an elementary theory, in which the electro¬ 
magnetic field is treated as a perturbation acting on an atom, can 
account for the absorption but not for spontaneous emission. To 
account for this, it is necessary to quantize the electromagnetic field 
and to treat the whole assembly, atom and radiation, as one dynamical 
system. 

The theory, due originally to Dirac,f is based on the following idea: 
The radiation is thought of as shut up in a box; there will thus be a 
series of normal modes each with its discrete frequency v . We know 
from the experimental evidence reviewed in Chapter I that the energy 
of light of frequency v can change only by multiples of hv; and thus it 
involves no great step forward to treat each normal mode as a vibrating 
system, with wave functions appropriate to a simple harmonic oscillator. 
A transition in which an atom emits or absorbs radiation is thus treated 
as a transition of the whole system in which the appropriate normal 
mode of the radiation changes its quantum number by unity. The 
perturbing energy (not a function of the time) is the interaction between 
the atom and the radiation. 

We have first to set up the Hamiltonian of the radiation field in the 
t P. A. M. Dirac, Proc . Bay. Soc . A, 114, 243 (1927). 
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absence of any charged particle. This field is conveniently described 
by a vector potential A chosen so that the scalar potential vanishes 
everywhere. The electric and magnetic vectors are then given by 
E = —-A/c, H = curl A. 


A typical standing wave, representing radiation shut up in a box, may 
be written A — /Jesin(xr)cos/cci. (21) 


Here e is a unit vector, /3 an arbitrary constant, and l/#c any length. 
In a volume V the number of normal modes with given polarization, 
frequencies between v and v+dv, and direction x of propagation in the 
solid angle d£l isf 

V Pv dvdQ = Vv 2 dQdv/c 3 . (22) 

The function p v = v 2 /c 3 will be called the density function of the 
radiation. 

We now write instead of (21) 

A = ^(87rc 2 )esin(xr )<?(£). (23) 


The quantity q(t) is the dynamical variable which specifies the state 
of the vibrating normal mode. We shall treat it exactly as the dynamical 
variable of an oscillator according to quantum mechanics. For this we 
need the Hamiltonian and hence the wave equation of the radiation 
field. The purpose of the normalizing factor ^(Snc 2 ) will appear below. 

The electric and magnetic vectors E and H derived from the vector 
potential A (23) are 

E = — <J( 87 r)eam(xr)q(t) 

H — % /(87rc 2 )(xxe)cos(xr)g(0 



Now since x and e are perpendicular vectors and e is a unit vector 
wehave (xxe) 2 = k 2 . 


Making use of this result and of the fact that the mean values of 
sin 2 (xr) and cos 2 (xr) are •£, we find that the energy of this wave is 

W = f (E 2 +H 2 ) dr — i(g 2 +c 2 /fV). (25) 


The momentum conjugate to the coordinate q in the Hamiltonian 
formalism is 

P 


8W . 

-j-7- = s; 
8 q 


so that, putting 


. 2„2 


CrK 


477 V, 


( 20 ) 


f Cf. J. H. Jeans, Dynamical Theory of Gases, 4th ed., chap, xvi (Cambridge, 1926). 



$ 47 PROPER TREATMENT OF THE RADIATION PROBLEM 261 

we have for the Hamiltonian of this vibration 

W = \{p 2 -\-^n 2 v 2 q 2 ). (27) 

It follows that, with the normalizing factor as chosen, the vibrating 
normal mode behaves like a simple harmonic oscillator of mass unity. 
This is true either in classical or wave mechanics. 

In wave mechanics we may set up the usual Schrodinger equation 

^+&{ W ~ 4’A’W = 0- (28) 

The possible values of the energy will be 

W n = (n+\)Kv, (29) 

where n is an integer, and the wave functions are the Hermitian 
polynomials discussed in Chapter II and the Appendix. 

The pure radiation field may then be described by the superposition 
of fundamental vibrations of the type (23) with a range of frequencies 

»i, v 2 , v 8 . 

The number of frequencies lying in any given range is determined by 
the formula (22), but the vectors e 8 , x s are considered to be distributed 
at random. The Hamiltonian of the radiation field is obtained by 
taking the sum of all the contributions W s of the type (27); thus it is| 

w = 2 w s = 2 ( 3 °) 

8 8 

In the quantum theory of radiation it is often more convenient to 
replace the cosine and sine waves representing the radiation field by 
their complex exponential forms, which represent waves moving in 
definite directions instead of standing waves. In this representation we 
write for the vector potential 

a = 2 A, = 2 {a,fc(<)+a* ?*(<)}, (31) 

8 8 

where the field coordinate q 8 (t) is now complex and 

a s = *J(±nc 2 )e a exp i(x a r). (32) 

The Hamiltonian of the radiation field can be calculated as in the case 
of real wave functions; it comes out to be 

w = 2 (4At+^lq a qt) = 2 q a q*). (33) 

8 8 

f It can easily be shown that the sum on the right-hand side of (30) representing the 
sum of the energies of all the oscillators is equal to the energy of the radiation field. It 
should also be observed that only the radiation field can be represented by a super¬ 
position of stationary waves in this fashion; such a representation is not possible for 
radiation in the presenoe of an external electromagnetic field. 
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Now since q 8 is a solution of the harmonic equation 

q a +^t\lq 8 = 0 , 

it follows that q s q* — q s q *; 

we may thus write the Hamiltonian (33) in the form 

W=S^lvU e qt- 

8 

This may be reduced to the form for the real variable 

w = \2 

8 

by the substitutions 



47.1. Hamiltonian of the complete system 

We have now to determine the Hamiltonian of the atom and the 
radiation field considered as a single mechanical system. The non- 
relativistic motion of an electron in an electromagnetic field described 
by a vector potential A and a scalar potential <j> is uniquely determined 
by the Hamiltonian 

where P denotes the momentum of the electron, m its mass. Writing 
V for 0/c, expanding the scalar product (P—eA/c) 2 , and retaining only 
terms of order 1/c, we have 


§> = ^/c+(P-eA/c) 2 /2m, 


£ = P 2 /2m+eF-e(P.A)/mc. (34) 


The Hamiltonian of the complete system comprising the atom and the 
radiation field is obtained by adding to (34) the Hamiltonian of the 
radiation field, equation (30) or (33) above, and by substituting for 
the vector potential A the corresponding expression (23) or (31). Substi¬ 
tuting from equations (31) and (33) into equation (34) we obtain for 
the Hamiltonian of the coupled system 


$ = So+$i> 

where £o = jr-P 2 +eF+ 2 

zm s 


(35) 

(36) 


is the sum of the Hamiltonians of the pure radiation field and of a 
particle moving in a field free from radiation, and 

Si = y {?.e^+<tfe^}(e..P) (37) 

m 

8 

is the energy of interaction between the electron and the radiation 
field. 
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The transition probability for a transition in which the electron 
jumps from one state to another, emitting or absorbing a quantum, 
can now be written down at once from formula (11). Suppose that the 
direction of propagation of the light wave lies in a solid angle dil about 
the direction x. Let this wave have N quanta before the process, N' 
after, and let the suffix 0 denote the initial state of the atom and the 
suffix n the final state. Then, substituting for U in the formula (II), 
the probability per unit time that the transition occurs is 

(38) 

where and are solutions of the unperturbed wave equation 

(39) 

and in this case 

{?e« Kr >+ q*e-« Kt >}(e . P). (40) 

m 

47.2. Calculation of the matrix elements 

In the last section we showed that, in order to obtain the transition 
probability of a process involving the emission or absorption of radia¬ 
tion, we have to consider the solutions of the wave equation (39) of the 
unperturbed system. Suppose that Y n is an eigenfunction and W n the 
corresponding energy value of the Schrodinger equation 

and that <f> N is an eigenfunction and W N a corresponding energy value 
of the wave equation 

(Y+^vy Hs = w*+s- 

Then the function T’v.n = % L <j> N (41) 

is an eigenfunction of the Schrodinger equation (39) corresponding to 
the eigenvalue ^ = W N +W n . (42) 

Substituting from (40) and (41) into the equation defining the matrix 
element (JV'',7fc|$ 1 |iV’,0), we obtain the equation 

(N\ nfoltf.O) 

= -^{(n|P,|0) J foqM+WPm j <f>U*<f> N dq), (43) 
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where the matrix elements (tt|i*|0) and (?i|P*|0) are defined by the 
formulae 

(n\P 8 \ 0)== [ne^(e.P)T 0 dr 

r • (44) 

(»|PJ|0) = f T*e-^>(e.P)%dr) 

Since the functions <f> N are the eigenfunctions of the wave equation 
for a linear oscillator, it follows that the integral 

/ tir&sdq 

vanishes unless \N— JV'I = 1 and also that 


J <!>* 


The energy corresponding to the eigenfunction </) N is 

W = hv(N+$). 


Substituting from equation (45) into equation (43) we see that the 
matrix element (N\ 0) is different from zero only if N' = iV± 1. 

We thus obtain for the two non-vanishing matrix elements 

{N+l,n\^\N,0) = - i J V*n^PeXdr 
(N-l,n\^\N, 0) = J Y* e-^ Pe %dr J 

where p e = (Pe) 


denotes the component of the vector P in the direction of the polariza¬ 
tion. 

For a process in which a quantum is emitted, N is zero, and for the 
absorption of a quantum N = 1. Thus the transition probability for 
a transition in which an electron jumps from one state (0) to another 
(n) with the emission of a quantum of frequency v is 

SRslJ (48) 

and the transition probability for a similar process in which a quantum 
is absorbed is 

sn|/'•>-">.’•'.4 <«> 

If the final state of the electron belongs to the continuous spectrum 
p w dW — mcpdWd£llh*c z , 
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where p is the momentum of the electron. In that case (38) becomes 

i££(f <«.) 

48. First-order radiation processes 
48.1. The emission of radiation 

As a first example of the application of the general theory developed 
in the preceding sections we consider the emission of light by a hydrogen 
atom. Suppose that initially the electron is in an excited state in which 
its wave function is Y n , and that it falls to the ground state with the 
emission of a quantum of radiation of frequency v. The wave function 
of the electron in the ground state is T 0 , say. Then the roles of the 
suffixes n and 0 are interchanged in equation (48). It follows from 
equation (8) that the transition probability is negligible unless the 
frequency of the emitted radiation is 

v ■■= (W n -W 0 )/h, 

which is the value given by Bohr’s frequency relation. 

The square of the matrix element occurring in the expression (48) 
can be reduced easily to a familiar form. In general the wave-length 
of the radiation emitted is large in comparison with the dimensions of 
the atom so that the function expi(xr) (k = 2ttvIc) will be practically 
constant over that part of the space in which the wave functions of 
the electron differ appreciably from zero. We may therefore write 

jY* 0 e^p e T n dr = J Y*p e Y n dr. 

If, now, we write P = mv 

and denote by 0 the angle between the velocity vector v and the 
direction of polarization, we obtain 

J x Y* Pe Y n dr = m cos© J TJvT.dr, 
so that || x ¥*p e Y n dr| 2 = m 2 cos 2 0 | J Y* vY w dr | 2 . 

Now | Y* v x Y„ dr = 2 mv J Y* xY n dr = 2mv(0\x | n) 

and similarly for the y- and ^-components of the velocity v. Thus 

| / Y* *> e Y„<Zr| 2 = 47rWcos a 0|(O|r|n)p (50) 

where |(0|r|»)| 2 denotes the sum 

(0|a;|re) 2 +(0]y|w) 2 +(0|z|ra) 2 . 


( 51 ) 
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Equation (48) then gives for the chance per unit time that a quantum 
is emitted into the solid angle dCl 

—oos 2 0|(O|r|w)| 2 <2£2. (52) 

tVC 


Multiplying this expression by hv and integrating over all angles we 
obtain an expression for the total intensity of the radiation emitted 
by the atom in unit time. The summation over the directions of 
polarization replaces cos 2 © by sin 2 #, where 8 is the angle between the 
direction of propagation x and the position vector r of the electron; 
making this substitution in (52) and integrating over all possible values 
of the angles, we find that the total intensity radiated per unit time is 


S = 


647r 4 e 2 v 4 
3c 8 " 


l(0|r|n)| 2 . 


(53) 


This is the same formula as that given on p. 257. 

In the classical theory, if we take as a model of the light source a 
single electron bound elastically to a massive charge and moving with 
simple harmonic motion of frequency v in a straight line with amplitude 
r, we find that the energy emitted per unit time is 


32*W ?2 

3c 3 


(54) 


We see therefore that the radiation is the same as that which would be 
given out by a classical dipole of amplitude r given by 

r 2 = 2|(0|r|ft)| 2 . 


48.2. The photoelectric effect 

We saw in the last section that when light is emitted by an atom 
its frequency v is given by the equation 

V = (W m — W n )/h, (55) 

where W m , W n are two energy levels of the atom. In a similar way it 
can be shown that light of frequency v can be absorbed by an atom 
only if v satisfies a relation of the type (55). Another process is possible, 
however, for light of such a frequency that hv exceeds the ionization 
potential, /. It may then happen that the electron absorbs a quantum 
of radiation of sufficient energy to cause it to leave the atom with 
kinetic energy hv—I. We can easily calculate the transition probability 
for a process of this type, if we assume that the kinetic energy of the 
ejected electron is very much greater than the ionization potential I 
and if we employ non-relativistio wave mechanics throughout. This 
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last assumption is equivalent to assuming that the kinetic energy of 
the electron in the continuous spectrum is small in comparison with 
the rest-energy me 2 of the electron. 

We shall carry out the calculation for an electron in a (Is) state and 
thus for the K -electron of an atom. 

In the integral occurring in the expression (49) for the transition 
probability, % is thus the wave function 

Y 0 = (tt a?)~*e~ r l a (a = a 0 /Z) 

of the electron in the if-shell, and, if the energy of the ejected electron 
is very much greater than the ionization potential, is the wave 
function corresponding to an electron in the continuous spectrum with 
momentum p, i.e. the plane wave 

Y n = e i(pr) l hc . 

If we write q = p—x/fic, 

then q is the momentum transferred to the atom and we have 

J '¥Zli c e-* Kr) %dT = p e (ira 3 )~i j e -^+^ T ^dr, 


where we have written a = he/a. Using the result 


we then obtain 


J 


e -yr+f(pr) ^ — 8 JIZ- 

(/}*+/)*’ 


f e-^ KT W a dr = v l ——^ 87rot ^ c ) 3 /gg) 

J t, A e ? 0 ar Pe^sJ (a2+?2)2 - W 

Substituting this value of the integral into the expression (49 a) for the 
transition probability and dividing by the intensity of the primary 
beam per cm. 2 (i.e. by c for a single quantum), we obtain the quantity 


32e 2 2 a 5 


dCl, 


(57) 


which has the dimensions of an area and is called the differential cross- 
section, or the effective area that has to be hit by a quantum to eject 
an electron in the solid angle dQ. 

To obtain the total cross-section for the ejection of photo-electrons 
in any direction we integrate (57) over all angles. Since the if-shell 
contains two electrons we multiply by 2 to obtain the cross-section </> K 
for the photo-effect of the if-shell; the result of the integration isf 



f W. Heitler, loc. cit., pp. 122-3. 
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where 



(59) 


If hv is nearly equal to I the exact wave functions of the continuous 
spectrum are used for in the evaluation of the matrix element. The 
formula (58) is then replaced byf 

exp(— Apcot^p) 137 3 //\ 4 


<f> K = 128tt- 


(60) 


where p is defined by 


p 2 


i 

hv—I’ 


i.e. p 2 is the ratio of the ionization energy to the kinetic energy of the 
ejected photo-electron. 


48.3. The breadth of spectral lines 

The theory developed in § 48.1 accounts for the emission of radiation 
from an atom and is based on the supposition that energy is conserved 
in the process. Such an assumption leads to a formula for the intensity 
of a sharp spectral line emitted by an atom. We know from spectro¬ 
scopic observations, however, that the lines of an atomic spectrum are 
not sharp but possess a certain characteristic breadth. That this natural 
line breadth does not follow from the analysis of §48.1 is due to the 
fact that in solving the equation 

= m (6i) 

where $ is given by equations (35)—(37), we employed an approximate 
method whose use is only justifiable for small values of the time t . 
An improved method of solution of equation (61) valid for higher 
values of t has been developed by Weisskopf and Wigner J and has been 
found to lead to a simple formula for the breadth of a spectral line. 

We shall consider only the case in which the atom has two states, 
an excited state described by the wave-function Y n and possessing 
energy W n and a ground state of energy W Q whose wave function is T 0 . 
The extension of the analysis to atoms with several states presents no 
essentially new features. We assume that initially the atom is in the 
excited state in the absence of radiation and that it falls to the ground 
state with the emission of a quantum of radiation hv 8 whose frequency 
v 8 is approximately equal to the fundamental frequency 

v 0 = (W n -W 0 )lh, (62) 

given uniquely by the energy difference between the two states, 
t M. Stobbe, Ann . der Physik t 7, 661 (1930). 

t V. Weisskopf and E. Wigner, Zeits. f. Physik , 63, 54 (1930); 65, 18 (1930). 
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The solution of equation (61) can readily be obtained if the solutions 
and the eigenvalues W n of the Schrodinger equation (39) are known. 
If we write T = £ b k (t)%e iW *W (63) 

and substitute this wave function in equation (61), we obtain the 
equation | %+iH J b k {t)e w ^Y k = 0. 

Multiplying both sides of this equation by the conjugate wave function 
and integrating over the space of all the coordinates—those describ¬ 
ing the radiation field as well as those defining the position of the 
electron—we obtain the system of differential equations 

*»(0 = (n\^\k)e^Ml f %(t). (64) 

0 

It follows from equation (63) that the quantity |6 n (£)| 2 is the probability 
of finding the system in a state of energy W n at time t. 

Denoting by {n, 0) the state in which no radiation is present and the 
electron is in state and by (0, l 8 ) the state in which the electron is 
the ground state and a quantum hv 8 is the only radiation present, 
equations (64) reduce to 


KiP) = |(°. 0)6 nO (0e^«-^ J 

Initially, at time t — 0, 

^w,o(0) = 6o,i,(0) = 0* 

If we now make the substitution 

KS) = ( 

in equations (66), we obtain immediately by a simple integration 

i A2iri{v 0 ~v t )-y]t — l 

WO = J(°. 1. fill*. °) V77-( 

to 2 t Ti(v 0 —v 8 )—y 

As t oo the exponential factor decreases rapidly to zero so that 


bo t iM )' 


i (0, lalSil^Q) ^ 


The probability that, after a time t, long in comparison with the life- 
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time l/2y of the excited state of the atom, a quantum of radiation hv 8 
has been emitted is therefore 

(O.IJ&KO)* 

& 2 {4t r 2 (v 0 —v a ) 2 +y 2 }‘ 

Multiplying by hv 0 and the density function p v , integrating over all 
directions of propagation and summing over both directions of polariza¬ 
tion we obtain for the intensity distribution 


I(v) dv = 


hv 0 dv 


j Py(n, 01 ^ 10 , 1 s )*dn. 


^{47r 2 (v„-v) 2 + /} 

Now the total probability of a spontaneous transition per unit time is 


w 


so that 


= p w (n,0\$>x\WdQ, 

/(v) 


( 68 ) 

(69) 


277 477 2 (v--^o) 2 + y 2 
where we have written I 0 for hv 0 . 

We have yet to determine the constant y. Substituting from equa¬ 
tion (67 a) into the first equation of the set (65), we obtain the relation 

v== IV [ 1 __ e [2ni(v s ~v 0 )+y}n 

y &Z<2nHv 0 -v 8 )-y L J * 

We now replace the summation with respect to s by an integral over 
v 8 , first multiplying the integrand by p w dwdil. We then have to 
evaluate an integral of the type 

J 2tt i(vv—v 8 )—y 

which reduces to $f(v 0 ). It then follows immediately that 

y = iw, 

where w is given by equation (68). The formula (69) then reduces to 

w I 0 


I(v) 


(70) 


2 n 4 tt 2 (v —v„) z +M> a /4* 

The form of the distribution function I(y) is shown in Fig. 63. I(v) has 
its maximum at v — v 0 when its value is 

I(v 0 ) — 2I 0 /ttw. 

It should also be observed that 

4„±£)=i/w. 

The formula (70) is identical with that deduced in the classical theory 
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for exponentially damped harmonic oscillators; in the classical result 
the formula ( 68 ) for w is replaced by 

w = 87 T 2 eh%/3mc z . 

Although the intensity of the line is measured by the total area under 
the curve, a reasonably accurate estimate of the intensity is given by 




the height of the peak. In actual practice it usually happens that there 
are several components of different intensities very close to each other. 
If the half-widths of the lines are comparable with the separation we 
get a distorted curve, in which it is difficult to distinguish the individual 
components; for example, Fig. 64 shows the intensity curve of the 
.ff a -line as measured by Hansen.| In this instance the experimental 
curve is composed of three component curves of which the central is 
almost completely overlapped by the other two. 

We have only considered the broadening of spectral lines caused by 

t G. Hansen, Ann, der Phyaik, 78, 558 (1925). 
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the damping due to the emission of the radiation itself. In addition 
to this there are several other causes contributing to the broadening 
of a line: 

(а) The Doppler effect due to the fact that the radiating atoms are 
not at rest.f 

(б) The effect of collisions of excited atoms with neighbouring 

atoms.J 

(c) Interactions of excited atoms with neighbouring atoms leading 
to a shift or splitting of the excited states.§ 

We shall not consider these effects here; for accounts of them the 
reader is referred to the papers cited. 

49. Radiation processes of higher order 

In the radiation processes considered above it has been possible to 
obtain the transition probabilities by calculating the matrix element 
for the transition from an initial state m to a final state n. 
Sj denotes the interaction between the electron and the radiation field, 
and are eigenfunctions of the wave equation (39). In these 

problems it so happened that these matrix elements did not all vanish. 
In many problems the matrix elements vanish for direct transitions 
from the initial to the final state. When this is the case it is necessary 
to improve the method of approximation. As in the case of second- 
order perturbation theory (Chap. Ill above), we have to repeat the 
calculations, retaining terms of a higher order. The inclusion of these 
higher order terms is equivalent to assuming the existence of ‘inter¬ 
mediate* states n' such that (n\^ x \n') and (n'\% x \m) do not vanish 
although (n\$ x \m) does. 

Thus in the case of scattering by an atom when a quantum k 0 is 
absorbed and a secondary quantum k is emitted we have two such 
intermediate states. In the first k 0 has been absorbed and k is not yet 
emitted; in the second k has been emitted and k 0 is not yet absorbed. 

In the general case, let us denote the initial state by n Q , the final 
state by n, and the intermediate states by n'. Then assuming a solution 
of the wave equation (61), in a series of the form (63), we have initially 

bn o(0) - 1, 

and all the other 6*s zero. 

t M. N. Saha and B. N. Srivastava, A Treatise on Heat , 2nd ed., Appendix 4 
(Allahabad, 1935). 

J J. H. Van Vleck and V. F. Weisskopf, Rev . Mod . Rhys. 17, 227 (1945); H. Frohlich, 
Nature (1946). 

§ V. Weisskopf, Phys. Zeits . 34, 1 (1933); Observatory , No. 713 (1933). 
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Thus, as a first approximation, we may take 6 no to be unity on the 
right-hand side of equation (64), and all the b m (t) zero except the b n >. 
In this way we obtain the system of differential equations: 


MO = l(n'\§ > 1 \n 0 )e^.-^W t 

MO = \ J (n\^\n')b n .(t)e^~^l\ 


Integrating equation (71) we obtain 

MO = 


e W*-W,W— 1 
W —W. * 

T 'tin Tr r). 


(71) 

(72) 

(73) 


the condition b n >( 0) — 0 being satisfied. Substituting from (73) into 
(72) we derive the differential equation 

bjt )=- y 

ft W n<l —W n . 

for the determination of b n . Integrating this equation we obtain 
M Y (»|^ 1 l» , )(» , |jQ 1 |« 0 ) [ g <(ir %- |, » l *-l eW-^ fi -l 


Z, 

n' 


Wn.~W« 


\ w„ 


w 

no TT n 


W n 


-WJtlh_ i 

T~W 

. rr n 


which may be written in the form 


M0 ~^ w n ~w n + 2 (w n -w n .) { w n -wj e * 1} ’ 


with 


■E- _ y M5il m 'X w 'l5il ra o) 
^ >1 Z, W—W. 

ft? rr n rr n' 


(74) 


If we form \b n (t)\ 2 and neglect terms containing W n —W n > since they play 
no role in this case, we obtain 


I MO I* = 


2|45xl 


(w n -wj 


1—'COS 


(Wno-~W n )t) 

h 


(75) 


If, as before, we assume that in the neighbourhood of the final state 
there is a large number of states with energy between W and PT+dETT, 
then by means of equation (75) we obtain for the transition probability 
per unit time 9 

w = T pw ^^' ( 76 ) 


For radiation processes for which both the matrix element (w 0 ISil w ) 
and the sum T ^') vanish, it is necessary to 

assume that two successive intermediate states n' and n" exist for 

5078 x 
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which (n , \^ 1 \n ,, ) i and (w*|$ 1 | n) are different from zero. The 

transition probability is again given by equation (76) except that now 
f>i is defined by the equation 

(nolgi|nQ(n1giK)(n1giln) ( ) 

-W n .)(Wn~W n .) * 1 ' 

The transition probability per unit time for the simple types of radia¬ 
tion problem considered in § 48 is, of course, also given by equation (76), 

but nOW & = (*o|$il»). (78) 

Thus we may classify radiation processes according to the type of 
formula used for the calculation of the quantity H x occurring in the 
formula (76). The radiation process is said to be of the first order if H x 
is given by equation (78), of the second order if it is given by equation 
(74), of the third order if it is given by (77), and so on for higher orders. 
Furthermore, it is evident from equation (78) that and consequently 
the matrix element (n^Sfr^n), °f the first order in e , so that for a 
radiation process of the first order, the transition probability per unit 
time, w , is of order e 2 . Similarly for radiation processes of the second 
order w is of order e 4 , and for those of the third order w is of order e 6 . 

Examples of radiation processes of the first order have already been 
considered above (§48). As an example of a radiation process of the 
second order we may cite the Compton effect. In this case the initial 
state consists of a quantum with momentum k 0 (|k 0 | = hvjc) and a free 
electron at rest with energy W 0 = me 2 ; in the final state we have a 
quantum of momentum k and an electron with momentum k 0 —k and 
energy W 0 +h(v 0 -~v). The only possible intermediate states are: 

(a) One in which the electron has momentum k 0 and there is no 
light quantum present. The quantum of light has been absorbed 
by the electron; in the transition to the final state the electron 
emits a quantum of momentum k. 

(b) Another in which the electron has momentum —k and there are 
two light quanta with momenta k 0 and k present. The electron 
has emitted a quantum of momentum k; in the transition to the 
final state it absorbs the quantum with momentum k 0 . 

It is possible to calculate the matrix elements corresponding to each 
of the transitions and from them to calculate the differential cross- 
section for the scattering process.f Other examples of radiation pro¬ 
cesses of the second order are: 

(i) The coherent scattering of light. 

t W. Heitler, The Quantum Theory of Radiation , 2nd ed. § 16. Cf. Chap. XI. 
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(ii) The Raman effect. 

(iii) Resonance fluorescence. 

(iv) ‘Two quanta’ annihilation of positrons. 

Among processes of the third order the most important is ‘Bremsstrah- 
lung’—the emission of radiation by an electron passing through the 
field of a nucleus; others include double scattering and the creation of 
positron-electron pairs in the field of a nucleus. 

It can be shown that the application of the quantum theory of 
radiation to all of these problemsf leads to reasonable results if the 
interaction, § l5 between the radiation field and an electron is con¬ 
sidered as small and the solution only worked out to the first non¬ 
vanishing approximation in $ v If, however, the higher approximations 
in are considered the result always diverges, so that the higher 
approximations have no physical meaning; thus it is found that the 
second approximation of the interaction energy between an electron 
and the radiation field diverges even if there are no light quanta present 
in the field. This energy is known as the transverse self-energy of the 
electron. The theory we have developed cannot therefore be regarded 
as exact. Recently, however, Heitler and PengJ have shown how the 
theory may be formulated to eliminate divergencies of this type. 

50. Radiation damping 

Introduction. It is well known that in considering the motion of an 
electron in classical electrodynamics it is necessary to consider the 
reaction of its field on its motion. The case of electromagnetic radiation 
in interaction with an electron is of interest in many applications, but 
the discovery of the meson in cosmic-ray experiments and the formula¬ 
tion of meson theories of nuclear forces has emphasized anew the im¬ 
portance of considering ‘damping’ effects in radiation processes. 

50.1. Classical theory of radiation damping 

The effect of radiation damping in the classical theory of radiating 
electrons was first considered by Lorentz, who used as a model for the 
electron a small sphere charged with electricity and hence possessing 
mass on account of the energy of the electric field surrounding it. 
Lorentz showed that the retardation of the electromagnetic field inside 
an electron produces a force which is equal to 

fe^/c 3 . (79) 

This expression is only a first approximation to the correct result for 
f See p. 287 below. f See p. 281 below. 
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an electron of finite radius, being the first term of an expansion in 
ascending powers of r 0 v/c, where v is the frequency and r 0 the radius 
of the classical electron. The form of the higher terms of the series 
depends upon the nature of the structure of the electron—i.e. on the 
nature of the forces that hold the charge on the electron—and there¬ 
fore cannot be treated in the classical theory. The main reason for 
retaining the Lorentz model of the electron was that it gave rise to the 
concept that all mass is of electromagnetic origin. However, the 
discovery of the neutron has proved that an elementary particle can 
have mass which is not of electromagnetic origin and thus removed 
the main reason for retaining the finite radius of the electron in the 
Lorentz model. 

The main difficulty in returning to a model of the electron as a point 
charge is that, if we apply Maxwell ’s theory, the field in the immediate 
neighbourhood of the point electron has an infinite energy. This diffi¬ 
culty may be resolved in either of two ways. It might be possible to 
devise a scheme of electrodynamics in which the field of a point charge 
is modified for distances smaller than a certain critical radius in such 
a way that the energy of the field becomes finite; the field strength 
calculated from the modified theory would then be less than the classical 
value for distances smaller than the critical radius. A modification of 
the classical Maxwell equations of this type is the non-linear field theory 
of Born and Infeld;f because of the non-linearity the equations of this 
theory are not easily handled and can be carried over into quantum 
mechanics only with great difficulty. 

Another way out of the difficulty is to consider that the classical 
value of the field strength as measured by the force on other particles 
is valid for all distances, but the particle’s own field does not contribute 
to its rest mass. This is the basis of Dirac’s classical theory of radiating 
electrons. J Dirac retains Maxwell’s theory to describe the field right 
up to the point-singularity which represents the electron, and by sub¬ 
tracting the infinite self-energy of the point-electron from the total 
energy of the system in an invariant way obtains a relativistic expres¬ 
sion for the force of retardation of the electromagnetic field inside an 
electron which reduces to the Lorentz expression (79) in the non- 
relativistic approximation. 

Furthermore, Dirac brings forward evidence for believing that this 

t M. Bom, Proc. Roy. Soc. A, 143, 410 (1934); M. Bom and L. Infold, ibid. 144, 425 
(1034); 147, 522 (1934); 150, 141 (1935). 

X P. A. M. Dirac, ibid. 167, 148 (1938). 
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expression is exact within the limits of the classical theory. The self- 
consistent scheme of equations set up by Dirac for the calculation of 
effects due to the interaction of electrons with radiation is identical 
with the set due to Lorentz. The two theories differ, however, in their 
interpretation of the finite size of the electron; in Dirac’s theory the 
finite size reappears in a new sense, the interior of the electron being 
a region of failure, not of the equations of the electromagnetic field, but 
of some of the elementary properties of space-time. 

For instance, let us consider the case of an electron initially at rest 
being set in motion by the passage over it of a pulse of radiation of 
infinitely short duration. If the pulse is moving along the y-axis but 
is polarized in the ^-direction the electric force may be represented by 

E= (ft «(*-*), 0,0), 

where k is a constant and 8 denotes the Dirac delta function.*)* If k is 
so small that the electron acquires a velocity small in comparison with 
that of light we may use the non-relativistic equation of motion 

^ = F. ( 80 ) 


X = 


which reduces in this case to 

x/r—x = k S(£), (81) 

where t = 2 e 2 /3rac 3 , k = 3c 3 &/2e 2 . 

The solution of equation (81) is readily found to be 

( ke tlr /m (t < 0) 
l kjm (t > 0). 

Thus for positive values of t the electron is moving with constant 
velocity k/m, as we would expect. There is, however, a departure from 
the ordinary ideas of causality; although the pulse does not act on the 
electron until the time t = 0 the electron acquires an acceleration before 
that time. It would appear that the electron knew in advance that 
the pulse was coming and acquired an acceleration just sufficient to 
balance the effect of the pulse when it does act. We can surmount this 
difficulty by assuming that the electron has a finite radius of order 

ct — 2e 2 /3mc 2 

t The Dirac delta function is defined to be zero when the argument is not zero and to be 
infinite when the argument is zero in such a way that the function and the axis enclose 
unit area, i.e. 

' 8(x) = 0 (x & 0), h(x) = oo (x = 0) 


J $(») dx : 


I. 
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since the electron begins to acquire an acceleration when the pulse is 
distant from the centre of the electron by this amount. 

Suppose now that electromagnetic waves are being radiated from a 
point A and detected at a point B a distance x+y away; and that 
there is an electron at a point E on the line AB distant x from A. 
Then if a pulse is emitted by A at t = 0, the electron will start radiating 
at a time / 


and a signal will be detected at B at time 

(x+y)lc—r. 

If we denote by c' the equivalent velocity of light in the interior of 
the electron and by b the radius of the electron, we then have 

-b 2b x+y 


± + 


“T, 


whence we obtain c' — c(l—cr/26)- 1 . 

Now b is of order ct so that c' is of order 2c; that is, it is possible for a 
signal to be transmitted through the interior of an electron with a 
velocity exceeding that of light—in contradiction to our ideas of the 
nature of space and time. 

The theory proposed by Dirac thus violates the postulates of the 
theory of relativity; for all that, the theory is constructed in such a 
way as to be invariant under Lorentz transformations. 

The difficulty that the electromagnetic energy becomes infinite in the 
neighbourhood of a point charge has been removed by Pryce*)* by the 
introduction into the energy integral of an additive vector field, which 
depends on the variables of the charge alone, so as to make the integral 
finite at the point charge. This additional field may be regarded as 
a negatively infinite energy having its origin in the charge itself. The 
procedures adopted by Dirac and Pryce are different, but both consist 
in adding terms to the usual expression for the field energy which make 
the total field energy of a point charge finite. That this procedure is 
physically sensible and not merely a mathematical device was suggested 
by Bhabha and Corben,J who extended the method to give the com¬ 
plete classical theory of a spinning particle moving in an electromagnetic 
field. 


t M. H. L. Pryce, Proc . Roy . Soc. A, 168, 389 (1938); see also G. Wentzel, Zeite.f. 
Phyaik, 86, 479, 635 (1933); 87, 726 (1934). 
t H.J. Bhabha and H. C. Corben, Proc. Roy. Soc . A, 178, 273 (1941). 
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Dirac’s theory leads to the non-relativistic equations of motion 

= F , (82) 

where F denotes the external force and dots denote differentiation with 
respect to t. These equations differ from the usual Newtonian equations 
of motion mv = F 

in that they involve differential coefficients of the third order. To solve 
the system of equations completely for the space coordinates (x,y,z), 
we need to know their initial values and those of v and v, thus intro¬ 
ducing more arbitrary constants into the solution than are necessary 
to fix the actual motion. The solutions of the equations of motion fall 
into two types—physical and non-physical solutions—which must be 
distinguished. In solving particular problems the procedure is to write 
down all the solutions of the equations (82) and then determine which 
of them are physically possible. For example, in the case of an electron 
moving in the absence of an external field, there are two solutions! of 
the relativistic equations, one in which the velocity of the electron 
tends to that of light and a second in which the electron moves with 
uniform velocity; the former solution is non-physical, the latter physical. 
The solutions corresponding to the motion of the electron of the 
hydrogen atom have been discussed by Eliezer, J who shows that there 
is no solution in which the electron falls into the nucleus. Eliezer has 
also considered the effect of radiation damping on the motion in a plane 
of an electron in a uniform magnetic field§; the physical motion is one 
in which the electron spirals round inwards until it comes to rest, but 
there is also a non-physical solution in which the electron spirals out¬ 
wards with increasing velocity. 

As an example of the use of equations (82) we shall consider the effect 
of radiation damping on the scattering of light by a free electron. 
A light wave with the frequency v and the field strength 

E = E 0 e 2irivt 

may fall upon a free electron which is initially at rest. The equation 
of motion (82) then becomes 

mx—yx = eE 0 e 2nivi , (83) 

where, for convenience, we have written 

y = 2e a /3c 8 . (84) 

t P. A. M. Dirac, loc. cit., p. 156. 

t C. J. Eliezer, Proc. Ocmb. Phil. Soc. 39, 173 (1943). § Ibid. 42, 40 (1946). 
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Equation (83) has the periodic solution 

x = x 0 e*"* w , 

where — m(27rv) 2 x 0 +i(27w) z yx 0 

v c ®o 


eE 0 


The total energy radiated per second is now 

1 


where 


2|v|* _2e_ 2 _ 

3 c 3 ~3c 3 ' 01 m 2 (l+ic 2 )’ 

27 ryv 47re 2 i/ 


m 3mc 3 

and the total cross-sectionf for scattering is thus 

8tt/ e 2 


* 


V— 

1 1 +» 


3 \mc 2 l+« 2 


(85) 


50.2. Quantum theory of radiation damping 

The incorporation into the quantum theory of the classical methods 
of Lorentz and Dirao has recently been discussed by several authors. 
The first quantum treatment of the effect of radiation damping on the 
scattering of light by free electrons was given by Waller, J but it was 
only by ignoring the initial conditions which the solutions of the per¬ 
turbation equations must satisfy that he was able to derive a solution 
of the equations. Waller’s theory was developed further by Gora,§ 
who overcame the difficulty arising from the initial conditions by 
assuming that the interaction, H v between an electron and the radiation 
field, is switched on adiabatically. Thus we assume that H x = 0 initially 
and is switched on over a time period large compared with the period 
of the light wave. Gora’s results are also incorrect because of the 
omission of some of the terms in Waller’s equations. The correct 
equations to include the effect of radiation damping for the special 
case of scattering were derived simultaneously by Heitler|| and Wilsonf f 
by entirely independent methods; Heitler adopted Gora’s assumption 
of adiabatic switching on of the perturbation energy, and Wilson em¬ 
ployed a limiting process to eliminate the terms which make the 
solution of the problem difficult. A divergence-free quantum theory 
of radiation including the effects of radiation damping was formulated 

t Cf. H. Casimir, Zeits, f. Physik, 81, 496 (1933). 

t I. Waller, ibid. 88, 436 (1934). 

§ E. Gora, Acta Phya. Polonica , 7, 169, 374 (1938-9). 

|| W. Heitler, Proc. Comb . Phil. Soc. 37, 291 (1941). 
ft A. H. Wilson, ibid. 37, 301 (1941). 
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subsequently by Heitler and Peng.f The theory proposed by these 
authors is frankly heuristic; they arrive at a divergenoe-free formation 
by simply omitting the divergent integrals from the original theory. 
This formalism is then shown to be Lorentz invariant. A general pro¬ 
cedure for this omission has so far only been formulated for one class 
of problems, the calculation of transition probabilities. Despite the 
heuristic basis of the Heitler-Peng theory its authors claim that, in view 
of its close agreement with experimental results, it will form a correct 
part of future quantum electrodynamics or a good approximation, 
having perhaps a similar relation to future quantum theory as the 
Bohr theory had to wave mechanics. 

Independently of these investigations DiracJ made an attempt to 
produce a new method of field quantization which would remove the 
divergence difficulties from quantum electrodynamics. By introducing 
the idea of quanta of negative energy and making use of a certain 
limiting process, Dirac showed that in his theory the integrals corre¬ 
sponding to the divergent integrals of the previous theories are con¬ 
vergent. Dirac's theory has certain features in common with the 
Heitler-Peng theory; in the limit of quanta of low energy both theories 
reduce to the classical theory of radiating electrons due to Dirac.§ The 
relation between the two theories has been discussed by Gormley and 
Heitler,|| who find that, contrary to expectation, the Heitler-Peng 
theory is not a consequence of Dirac’s quantum electrodynamics. It is 
found that for processes involving the scattering of high-energy quanta 
the two theories give rise to very different results, and that, in certain 
cases, Dirac’s theory has no solution at all whereas the Heitler-Peng 
theory has. This last difficulty arises from the quanta of negative 
energy, and it would seem that this part, at least, of Dirac’s theory 
has to be abandoned. 

No completely satisfactory basis for the subtraction process employed 
by Heitler and Peng has yet been found, but numerous applications 
have led to results in agreement with experiment. As this theory is 
discussed at some length in the last chapter of Heitler’s bookff we shall, 
in the following sections, mainly follow the method of treating radiation 
damping developed by Wilson. The connexion of this method with the 

t W. Heitler and H. W. Peng, ibid. 38, 296 (1942). 

j P. A. M. Dirac, Proc . Roy . Soc. A, 180,1 (1942); Comm. Dublin Inst. Adv. Studies, A, 
No. 1 (1943). Cf. W. Pauli, Rev. Mod . Phys . 15, 176 (1943). 

§ Cf. last section. 

|| P. G. Gormley and W. Heitler, Proc . Roy. Irish Acad . 50, A, 29 (1944). 
tf W. Heitler, The Quantum Theory of Radiation , 2nd ed. (Oxford, 1942). 
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physical concept of 'damping' is not immediately obvious; it would 
seem that Wilson’s method consists merely of a more accurate solution 
of the wave equation for a Hamiltonian involving the electron and the 
radiation field. The relation of this process to radiation damping is 
purely formal—the mathematical analysis corresponding closely to the 
subtraction method of Dirac for the classical theory. 


50.3. Solution of the radiation equations for direct transitions 
We first of all consider the case in which the transition between the 
initial and final states takes place directly, and not through inter¬ 
mediate states. Let n 0 denote the initial state of the system, and n 
one of the possible final states of the system in which a quantum of 
radiation of any frequency, polarization, and direction of propagation 
has been emitted, such that the energy of the complete system is 
approximately conserved. As in §48.3 we have to solve the wave 
equation 

(So= 

of the coupled system where $ 0 is the sum of the Hamiltonians of the 
pure radiation field and of an electron moving in a field free from 
radiation and is the energy of interaction between the electron and 
the radiation field. We assume again a solution of the form (63) where 
the Y*., W k are the eigenfunctions and eigenvalues of the Schrodinger 
equation 

It follows immediately from equation (64) that the perturbation equa¬ 
tions for the probability amplitudes b no (t ), b n (t) are 

K(t) = 12 KISxl \n)b n (t)e<W-^\ (86) 


kV) = 

(87) 

The initial conditions satisfied by the probability amplitudes are 

M(0) = 1, MO) - 0. (88) 

If we write the solutions of equations (86) and (87) in the form 


&«,(*) = <rV ‘> M<) = U n 


Wn t — B»+% 


(88 a) 


we see that the initial conditions (88) are automatically satisfied. The 
function U n is a function of the variables defining the final state; it is 
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assumed to be a slowly varying function of W n which is neither infinite 
nor zero when W n = W nt . If we substitute for b n> , b n , in equation (87) 
we obtain immediately 




W n -W n .+My \ 


As before we replace the summation by an integral over W n . Denoting 
by p Wn doj n dW n the number of states with energy between W n and 
W n ~\-dW n for which the quantum of radiation is scattered into the solid 
angle dw n , the sum is then replaced by 

C l—e-yi+WVn'-WnW 

\ AM? I A W TT / l I ./ V 7 JTTT 








IWn^W* o 

Substituting for the sum in (89) and for b n {t) by its value (88 a) we 
obtain finally 

u n = (nl^ilnoJ+iwf f (nl^ln'jU^pjr duJ . (90) 

Equation (90) is an integral equation for the determination of the 
function U n . 

We have next to determine the parameter y. Substituting from 
equation (88 a) into equation (86) we obtain 

i ' ST / I- | vt-t 1— eYt+Wn-Wnjiih 

Replacing the sum by an integral as before we have 


Y — ft j ( w ol$il n )^7iPu r „^ a> ?i 


As an example of the solution of equation (90) we consider the 
scattering of light by free electrons. If we denote by n 0 the initial state 
of the whole system (i.e. electron-f-quantum) and its final state by n, 
then the matrix element (w 0 l$il w ) is given by equation (49 a). Substi¬ 
tuting the Schrodinger wave functions for a free electron into (49 a) 
we obtain * _ 2 

KISi \n) = ~--(e 0 e), 

mv 

where, as before, e 0 and e denote unit vectors in the initial and final 
directions of the polarization respectively. As a trial solution we put 

U n = C/(e 0 e), 
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then 

where 


K 



mv 


e 2 v j ( e ° 
2 me 3 


I 

= 1 —%K 9 

e')(e'e) d<o' 


Now it is readily shown that 

J (e 0 e')(e'e)<fa»' = y(e 0 e), 


so that k = 47re 2 i'/3mc 3 . Integrating over all directions of the emitted 
quantum and summing over the two polarizations we obtain the 

formula = 8W e 2 \ 2 1 

3 \mc 2 / 1-f* 2 


4> 


(85 a) 


for the total cross-section of scattering, in agreement with the classical 
equation (85). 


50.4. Transitions through intermediate states 

If in considering the scattering of light by free electrons we use 
Dirac’s relativistic equation for the motion of an electron, we find that 
the transition between the initial and final states cannot take place 
directly but must take place through intermediate states. We shall 
now derive the integral equations corresponding to equation (90) for 
transitions of this type. As before, let us denote the initial and final 
states by n 0 and n; and let us denote an intermediate state attainable 
from the ground state of the atom by the absorption of a light quantum 
with a given frequency, polarization, and direction by m. The per¬ 
turbation equations then take the form 

-ifiin.it) = 2 ("olSi I(92) 

m 

-ifiijt) = 1 (m\^ 1 \n)b n (t)e^n-wjiin > (93) 

-ih6 n (t) = ^ (n\^\m)b m (t)e^-^. * (94) 

m 

In these equations W no and W n are approximately equal, but may be 
very different from W m) the energy of the intermediate state; the inter¬ 
mediate states are such that momentum is approximately conserved. 
The probability amplitudes b n ^t), b n (t) again satisfy the initial condi¬ 
tions (88) and in addition 6 m (0) = 0. Substituting from equations (88 a), 
the solution of (93) satisfying this initial condition is found to be 

MO - {(-IftW+HJ WfcW 

( 95 ) 
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Substituting from (88 a) and (95) in (94) we obtain 
U n e-vt+HK,-wM* 

= 2 WSiM(M$>iI»o)+* 4 f (m\^i\n')U n .p Wn d(oJ ] X 
TO ( l J J fPn’^Wn,) 

e-yt+W^-WnlllK _ e i(JV m -WJllh 

X- w — W JCh -• ( 96 ) 

W 7lf >—W m +ihy 


Let us now consider in particular the scattering of light by a free 
electron. If a free electron absorbs a quantum the resulting inter¬ 
mediate state m 0 is sharply defined by the law of conservation of 
momentum and the sum with respect to m reduces to a single term— 
that with m replaced by m 0 . The resulting equation has no solution 
since there is no term on the left which we can equate to the term with 
factor e i(Vf ^- w ^ n on the right. Instead of considering the electron to 
be free we consider it to be bound, the binding energy of the electron 
being small in comparison with the energy of the quantum of radia¬ 
tion. There is then a continuum of intermediate states and it is possible 
to make the corresponding terms involving e^ Wm ~ fV ^ h on the right 
of equation (90) give a zero contribution to the final result. To 
demonstrate this we have merely to observe that the matrix elements 
(w 0 ||)M a re not delta functions—although (w 0 l-dil m o) is—so that the 
summation with respect to m gives zero for sufficiently large values of t 9 
since e^ Wm ~ w ^ h is a rapidly varying quantity and the denominator 
does not vanish at any point where the matrix elements are not zero. 
In this way we can equate the coefficients of 0 -wjtin and then 

pass to the limit of a free electron by replacing the sum by a single 
term. In this way we obtain the equation 


W5iK)K)lSifoo) 

w n 0 


(w|Si|to 0 )[J (m 0 l$il»O£W>*v do> n /J 


-Wmo+My 


for the determination of the unknown function U n . 

This procedure gives us the formulae for the scattering of radiation 
by a free electron initially at rest; by applying a Lorentz transformation 
we obtain the results for the case in which the electron is in motion 


originally. The method outlined above seems at first sight rather 
artificial, but, as remarked by Wilson, it is in fact closely analogous 
to the procedure by which the scattering of an electron by a Coulomb 
field is derived from Bom’s scattering formula by calculating the scat¬ 
tering produced by a screened nucleus and then proceeding to the limit 
of zero screening. 



286 


INTERACTION OF RADIATION WITH MATTER 


§60 


To determine the parameter y of equation (97) we substitute from 
equations (88 a) and (95) into equation (92) to obtain 

ihye-v 1 = ^ {(w|§ 1 |» 0 )+i7r[J (m\§i\n)U n p Wn J (n 0 |§i |m) X 

w e -vt—e<)W m -w^ih 

x K.~w m +My' 

Eliminating the terms involving the factor and proceeding 

to the limit of zero binding we obtain 

ihy r r 1 

Ift u w. la i™ ^ ("ol&K) (™o\5>i\n')U n .p W ' il dw n .\ 

~ W n -W mt +ihy + W n -W m ,+My 

If there is more than one intermediate state in the case of zero 
binding the analysis proceeds along similar lines. 

The y 9 s in the denominators of equation (97) can usually be neglected 
and we obtain an integral equation of the type 

V. - PJ+.V j K{n,n’)U, fw..i*«, <»*> 

where ™ = V («IS, WW&KI (99) 


TJO _ V (w|SlNKw||>lK) 

“ i w n - w m 
*(»,»■) - Y WS.WWS.K), 


It is possible to investigate the influence of radiation damping on 
scattering processes by a more conventional method, namely, the 
stationary method of the perturbation theory. Ma and Hsiieh have 
shownf that this method gives rise to the integral equations derived 
by Heitler and Wilson for two-stage transitions when the effect of 
radiation damping is included in the analysis. 


50.5. Solutions of the integral equations 

Because of the complicated nature of the kernel of the integral 
equation it seems hopeless to attempt to derive a general solution. 
Some progress has, however, been made with the solution of the integral 
equations for certain special problems. The influence of radiation 
damping on the formula for the scattering of radiation by free electrons 
has been investigated by Wilson, { using Dirac’s relativistic wave 
equation for the electron. Wilson considers the scattering in the frame 
of reference in which the total linear momentum of the system is zero; 

t S. T. Ma and C. F. Haueh, Proc. Camb. Phil . Soc. 40 , 168 (1944). See also W. Pauli, 
Meson Theory of Nuclear Forces, chapter iv (New York, 1946). 

X A. H. Wilson, loc. oit., p. 307. 
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the kernel does not then depend too strongly on the angle of scattering 
and we can replace it by an average value. Thus equation (98) is 
replaced by the equation 

CTJ1+4.W5*) = Ul 

where the bar denotes a mean value of the nature of a root mean 
square. Hence , ™, 2 

| KJ 2 =-—. 

With this approximation Wilson obtains the general formula 


__ 

9 l+^o PW 


( 101 ) 


for the differential cross-section, d<t> , in terms of the differential cross- 
section for zero-damping, dcf> 0 \ the initial momenta of the light quantum 
and the electron are p and — p. This formula is not exact; the second 
term in the denominator should be multiplied by a numerical factor 
of order unity to improve the approximate method of averaging. 

By means of this formula Wilson shows that the damping is given 
correctly by the classical theory for low frequencies; however, at high 
frequencies (hv > 137mc 2 ) he finds that, contrary to classical expecta¬ 
tion, the influence of damping on the cross-section is always negligible.f 
Since all applications of radiation theory to high-energy problems of 
the type arising in the theory of cosmic rays depend on the smallness 
of the damping effect, this result is of considerable importance. Because 
of this, and the fact that Wilson’s procedure of averaging under the 
integral sign may not lead to correct results, the problem has been 
re-examined recently by Power expanding the solution of the integral 
equation in ascending powers of e 2 , Power shows that the extra term, 
due to damping, in the Klein-Nishina formula is small in comparison 
with the main term, thus confirming Wilson’s result. 

Approximate solutions of equations (90) and (97) have also been 
discussed by Hsueh and Ma§, by means of a variational method. 
Separating the summation over the directions of polarization from the 
other integration they write equation (90) in the form 


U n = (»ISiK)+*" 2 I (*l$ll»') U n 'P W „dw n ; 

Pn’ J 


f The formula (Klein-Nishina) for the cross-section without damping is discussed 
on p. 316. 

t S. C. Power, Proc. Roy. Irish Acad. 50 , A, 139 (1945). 

§ C. F. Hsueh and S. T. Ma, Phys. ftev. 67, 303 (1945). 
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and not© that it is equivalent to the equation 

2 2 f f (»l$il n'Wn'PnduJpn^ = 0, 

Pi* Pn J L Pn’ J J 

where 817* denotes an arbitrary variation of the complex conjugate 
of U n . Substituting 

u n = «(»!$! K), 

where a is a parameter independent of n and n 0 and varying U*, we 
have 

8ot*22 f KISiMx 

Pn 0 Pn J 

X J(^|Sll^o)( a 1) ^“2 J ( n \$>l\ n ')( n ~ 

Solving this equation for ol we obtain 

1 


where 

*2 2 2 f f (^ 0 1^11^)(^ ISl 1^' )(^' 1^1 l^o)pn^ 

K __ Pi to Pn Pn> J J _ 

2 2 f f 

Pn o Pn J J 

The formula for the cross-section is now 


d<f> 


1 d<f> 0 


d(f> 0 

1+K 2 ’ 


( 102 ) 


This formula is of the same form as Wilson’s result (101), but the 
damping factors in the two formulae are different. To test the validity 
of formula (102) it is of interest to use it to calculate the cross-section 
for the non-relativistic scattering of a quantum of radiation by an 
electron at rest and then to compare the result with the exact solution 
(85 a). Now from the equation 

HSiK) = ^W) 

mv 

we have that 

- e u y 2 212 ff (e 0 e)(ee')(e' e 0 )dw n dw n . 

H » _ ________ __ Cg € 6* » " 


mv hc a 


and equation (102) then gives 


22 f (e 0 e)(ee 0 )dw n 

e 0 e J 


jj __ (e 0 e) 

71 mv 1 —ik 


in agreement with the exact result (85 a). 
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We have seen that the effects of radiation damping are unimportant 
for processes involving free electrons. On the other hand, it was found 
by Heitler and Wilson that damping effects are appreciable in all 
processes involving mesons. Since then there have been various investi¬ 
gations of the effect of damping in problems on the motion of mesons. 
Relativistic formulae for the scattering of mesons by nuclear particles 
under the influence of radiation damping have been derived by Ma and 
Hsueh.f The theory of radiation damping has also been applied to the 
creation of mesons by various processes. It was shown by Heitler and 
PengJ that in a collision between a fast nuclear particle and another 
nuclear particle at rest mesons are emitted with a very high probability; 
the production of mesons by light quanta has also been investigated^ 
These papers form the basis of a theory!, J which gives a satisfactory 
account of all the chief cosmic-ray phenomena associated with mesons, 
including their creation, their diffusion through the atmosphere, meson 
showers, and the transformation of charged mesons into neutral mesons; 
it is assumed that the mesons are produced by a primary radiation of 
protons. The agreement with the experimental results is sufficiently 
close to make it seem probable that the theory of cosmic-ray mesons 
based on the quantum theory of radiation damping is fundamentally 
correct. 

t S. T. Ma, Proc. Camb. Phil . Soc. 39 , 168 (1943); S. T. Ma and C. F. Hsueh, ibid. 
40 , 167 (1944). 

X W. Heitler and H. W. Peng, Proc. Boy. Irish Acad- . 49 , A, 101 (1943); W. Heitler, 
ibid. 50 , A, 155 (1945); H. W. Peng, ibid. 49 , A, 245 (1944). 

§ J. Hamilton and H. W. Peng, ibid. 197 (1944); J. Hamilton, W. Heitler, and 
H. W. Peng, Phys. Bev. 64 , 78 (1943). 
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XI 

RELATIVISTIC QUANTUM MECHANICS 

51. Introduction 

The theory of the spinning electron developed by Pauli and Darwin 
(Chap. IV) on the basis of the Schrodinger wave mechanics is incom¬ 
plete in two important respects. Though it provides a suitable mathe¬ 
matical scheme for the description of many of the properties of the 
electron’s spin deduced from spectroscopic observations, it gives no 
satisfactory account of its origin. Furthermore, no attempt was made 
to take account of the special theory of relativity, so that the theory 
cannot be applied to the discussion of problems involving electrons of 
high energy; and for many problems, such as the fine structure of the 
spectral lines of hydrogen, the corrections introduced by spin and by 
relativity are of the same order. Before the introduction of the concept 
of electron spin into wave mechanics various attemptsf were made to 
establish a wave equation of the Schrodinger type invariant under 
Lorentz transformations. Unlike the non-relativistic Schrodinger 
equation, these equations were of the second order in the time- 
differential operator d/dt. Dirac solved both problems, that of con¬ 
structing a relativistically invariant wave equation and that of 
explaining the origin of the electron spin, by observing that if the 
general interpretation of non-relativistic wave mechanics is to be pre¬ 
served, the relativistic wave equation must be linear in d/dt . The 
beauty of Dirac’s method lies in the fact that once a Lorentz-invariant 
wave equation, linear in the operator d/dt , has been constructed, the 
magnetic moment assumed in the model of the spinning electron pro- 
, posed by Uhlenbeck and Goudsmit appears as the result of a simple 
calculation without the addition of a new physical assumption. Dirac’s 
theory of the electron has met with conspicuous success in its applica¬ 
tion to the fine structure of atomic spectra, the scattering of fast 
electrons and of hard y-rays, and the formation of pairs of electrons 
and positrons. The agreement with the observed experimental results 
is so good that it appears that Dirac’s equation for a single electron is 
essentially correct. On the other hand, there is considerable doubt as 
to the validity of the extension of the theory to two electrons. 

Before considering in detail the possible forms of the relativistic 

t E. SohrOdinger, Ann. der Physik , 81 , 109 (1926); O. Klein, Zeits.f. Physik , 37 , 895 
(1926); V. Fock, ibid. 38 , 242 (1926); ibid. 39 , 226 (1926). 
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wave equation for an electron it will be of advantage to state the general 
conditions that we should expect such an equation to satisfy. The 
equation should satisfy the requirements of the special theory of rela¬ 
tivity, that is to say, that it should be invariant under the group of 
Lorentz transformations. There is another kind of invariance which 
the equation must possess; this is 4 gauge-in variance’, the importance 
of which was realized first by Hermann Weyl.f In classical electro¬ 
dynamics the field strengths E and H are related to the scalar and 
vector potentials <f> and A by the formulae 

H “ curl A, E = — grad<£ — - —, 

8 y c dt’ 

div A-J-i — = 0. 
dt 

The transformation 

= <£'+ 1 A = A' — 1 grad A, 

ec dt e 


where A is a solution of the wave equation 


c* dt‘ ’ 


leaves the form of the above equations unaltered and is called a gauge 
transformation. 

We should also expect to be able to form a 4-vector representing 
current and charge density and to prove by means of the wave equa¬ 
tion that the current satisfies a continuity equation both in the presence 
and in the absence of an external electromagnetic field. (The analogous 
case in the non-relativistic theory is given on p. 259.) Finally, we 
should be able to predict the existence of the spin and of the magnetic 
moment of the electron from the wave equation. 

At first sight it would appear that it is necessary for the theory to give 
only positive values for the energy of the particle. We shall see that 
negative energy states of the electron arise in Dirac’s theory but that 
it is possible to make use of them to account for observed properties 
of positrons (positively charged particles with the electronic charge 
and mass). Though Dirac’s theory had this unexpected triumph of 
identifying the positron with a ‘hole’ in the aggregate of negative 
energy states, such a hypothesis, strictly speaking, removes from the 
theory its original basis as the theory of a single electron. Indeed, if 

t H. Weyl, Oruppentheorie und Quantenmechanik (Leipzig, 1928). 
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we wish to obtain valid relativistic interpretations we cannot even 
properly formulate the problem of a single particle; we must instead 
study the many-particle problem from the beginning. In the many- 
particle problem in which the particles are considered to be moving 
at very high speeds the charge density —eP need not be negative (in 
the algebraic sense) because of the possibility of the creation and 
annihilation of positrons. In other words, the probability function P 
need not always be positive in the many-electron problem. 


52. Scalar relativistic wave equations 
52.1. The Hamiltonian for the relativistic motion of an electron 
In this section a derivation will be given of the Hamiltonian of a 
charged particle moving in an electromagnetic field. Consider the 
motion of an electron of rest mass m and charge — e in an arbitrary 
electromagnetic field whose electric and magnetic vectors are denoted 
by E and H respectively. When the velocity of the electron is v its 
momentum is given by mv 

so that the equations of motion are 


d I 7RX 

dt \^(1 —v 2 /c 2 ) 



-(vaH) = 0. 

C 


(i) 


Now if we denote the scalar and vector potentials of the field by <f> 


and A, then 

E = —grad<4 —H = curlA, 
c dt 


and the vector equation of motion of the electron (1) reduces to 
d I mv \ , . * e dA e t A . A . 

*W=m ) = ,8ri4+ ;ir-; |,An ' IA) ' 

The equations of motion (2) may be written in the Lagrangian form 

d /&L\ dL 
dt\dqj dq 

if the Lagrangian function L is taken to bo 


( 2 ) 




L~ — mc 2 ^j{\—v 2 /c 2 )+e<f> — ~(v.A). (3) 

c 


This may easily be verified, 
the relations 


The conjugate momenta are defined by 


Pi 


8L 

Hi 
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so that for the dynamical system described by the Lagrangian function 
(3) the momenta are p x , p uy p zi where the vector p is given in terms 
of the vectors v and A by the equation 

mv e A 

P ~ ^(l-V 2 fc 2 )~C ' ( ) 


The general expression II — 2 ViPi~~^ 

i 

for the Hamiltonian of a system whose Lagrangian function is L, 
reduces for a single particle to the simple form 

H = (p.y)—L. (5) 

Substituting from equations (3) and (4) into equation (5) we obtain 

me 2 


II 


- exf >. 


V(1— , y 2 /c 2 ) 

Solving equation (4) for v and substituting in this last equation we 
find that in terms of the momentum the Hamiltonian of a single particle 
moving at speeds comparable with that of light is 

H = Cy/{m 2 c 2 -f (p+e A/c ) 2 }—. (6) 

For a conservative system the Hamiltonian, H , is equal to the energy 
of the system, which is denoted in this book by W. Writing p 0 — IF jc, 
we see that equation (6) may be written in the symmetrical form 

€<f) 


~{po + jj +( P + v) + m2 ° 2 


= 0. 


(7) 


52.2. Approximate relativistic equations 

The difficulty in setting up the relativistic Schrodinger equation 
directly from the Hamiltonian (6) arises from the interpretation of the 
square root. Approximate equations can, however, be derived by 
expanding the Hamiltonian H defined by equation (6) above in terms 
of v/c. In this way we obtain as a first approximation (cf. Chap. I, § 8) 

e 


H — me 2 A- 


_L| p+ ! a ) _ 


e<f). 


( 8 ) 


In the second approximation we replace equation (8) by 

\2 l 


H 


mc 2 +--^-(p + -A] 
~2m\ ~c / 


8 m 3 c 2 




If in this equation we write H = W'-j-mc 2 and replace (p+cA/c) 4 /8m 3 c 2 
by its first approximation (W’+e<f>) 2 /2mc 2 given by equation (8), we 
find that to the order v 2 jc 2 

eA\ 2 ,„ r( . , v 1 


JLfp+^y 

2 m\ v ^ c ) 


(W+e<l>)-~(W'+e<l>) 2 = 0. 


(9) 
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The Schrodinger equations corresponding to these Hamiltonians are 
obtained in the usual way by making the substitutions 

b d 

~idt ’ P 


H 


i(d_ d_ d\ 
\dx dy dzj 


-grad 


( 10 ) 


in equations (8) and (9). This procedure applied to (8) yields the 
equation 

which may be reduced to the form 


r‘grad + ^}*F + (mc 2 -e<f>)Y = 0, 
* Cl l ot 


ft 2 <1 of a 

■ V 2 T + - {A grad T + |T div A} +1 ^ - 

JiTtl 7YIC 


e 2 A 2 \ 
2mc 2 j 




h d*Y 

i ^ v ' 

In this equation it should be noted that div A does not vanish unless 
the scalar potential is constant with time, since </> and A are related 


by the equation 


div A-f 


1 3<f> 
c dt 


The equation (11) is correct only to the order vjc, where v is tlie velocity 
of the electron. 

It will be noticed that equation (11) differs from the non-relativistic 
equation used in § 8 only in the small term in A 2 , which can be neglected 
in most applications, and in the term mc 2 Y, which can be removed by 
transforming to the new wave-function 

vp' _ e imcHlh\^r 

If we multiply equation (11) by T*, and its complex conjugate by 
X F and subtract, we obtain an equation which may be written in the 
form of a continuity equation 

I’+divj = 0, (12) 


if p and j are taken to be defined by 
p = —eTT* 

pfj p2 

j = —( l F*gradY—T gradT*) - —-ATY* 
2im me 


(13) 


Similar results may be obtained from equation (8) giving a better 
approximation. 
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52.3. The Klein-Gordon Equation 

It was suggested by Gordonf that the exact relativistic wave equa¬ 
tion should be established by making the substitutions (10), not in 
equation (6), but in the more symmetrical equation (7) and allowing the 
operator so formed to act on a wave-function Y. In this way we obtain 
the relativistic wave equation 


^ grad -f- —) - 


.pi_^\ 2 + m 2c 2 W= 0. 

\icdt c) ^ / 


This equation is known as the Klein-Gordon equation. 

The Klein-Gordon equation involves us in serious difficulties in 
interpretation which arise because it is not linear in the time-differential 
operator d/dt. In non-relativistic quantum mechanics the Schrodinger 

(n+$$r — 

is linear in d/dt. It follows that if Y is known initially, the equation will 
determine it at all subsequent times. The results of any measurements 
on the position and momentum of a particle are used in forming the 
wave-function Y; the correct initial condition is thus that Y should 
bo known, not Y and d'Y/dt. With the Klein-Gordon equation, on the 
other hand, it would be necessary to know both Y and SV/dt in order 
to determine Y at all subsequent times. 

The second difficulty arises when we attempt to find an expression 
for the current and charge density. Multiplying equation (14) by Y*, 
the complex conjugate equation by Y, and subtracting we find that 
the resulting equation may be written in the form (12), where now 

D eh f^aY _dY*) e 2 ^ 

p= - ep - 

, , ( is ) 

j = ™.{T*gradT-TgradY*}--- AW* 

ZiTiVt me j 

These expressions for the charge density and the current density are 

due to Gordon and Klein .J The difficulty arises from the interpretation 

of the quantity P defined by the first equation of (15). In the non- 

relativistic Schrodinger theory P would be the probability function for 

the position of an electron. Such a probability function satisfies two 

general conditions: 

(i) P>0, (ii) |Jpdr = 0, 

t W. Gordon, Zeits.f. Physik, 40 , 117 (1926). 
t W. Gordon, ibid. 40 , 121 (1920); O. Klein, ibid. 41 , 414 (1927). 
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the integral being taken over all space. In a completely satisfactory 
relativistic theory we should expect P to satisfy both of these conditions 
together with the further condition that 

jPir 

is invariant under Lorentz transformations. We have already shown 
that the condition (ii) is satisfied by showing that p satisfies a continuity 
equation of the type (12). The third condition is satisfied automatically 
if it can be shown that p may be expressed as the fourth component 
of a 4-vector. This is readily done by considering the transformation 
properties of the set of four quantities 

fy —(ix»j V ’iz’ ic p)- ( 16 ) 

The condition (i), however, is not satisfied, since 

,0T* 


« (yyi 

2 mcH \ 




dt) 


me 2 


and this can be made negative, both Y and dY/dt being arbitrary at 
the initial point in time. P is not therefore a satisfactory expression 
to replace the probability function YY* of the non-relativistic theory. 

Although the expression for P given by equation (15) seems at first 
sight to be totally different from the non-relativistic formula Y l F*, it 
can easily be seen that the formula derived from the Klein-Gordon 
equation reduces to that given by the Schrodinger equation when the 
kinetic energy of the particle is very much smaller than its rest energy. 
For if we substitute 

WY 


hSY 

i dt 


in the first of equations (15), we obtain 


II/W* 

me* 


From the fact that W 


W'+mc 2 it follows immediately that 
W'+ecf> \ 


P = 'FT*| 1 


me* 


r 


(17) 


(18) 


If now the kinetic energy is assumed to be small in comparison with 
the rest energy me 2 we find that P is approximately equal to YY* as 
in the non-relativistic theory. 


53. The Dirac wave equation for an electron 
53.1. The Dirac equation for an arbitrary field 
The considerations of the last section show that the correct relativistic 
wave equation must be linear in the differential operator d/dt; further- 
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more, in conformity with the special theory of relativity, the equation 
must be invariant under a Lorentz transformation. In the theory of 
relativity there is complete symmetry between the space coordinates 
x , ?/, z and the time coordinate ict , so that if our equation is linear in 
d/dt it must also be linear in d/dx, d/dy, and d/dz. For this reason 
Diracf took the relativistic wave equation to be of the form 

0 = {(*»<»+ C t) ++ ° c “) + a " [ Pu + T*) ++ T 5 ) +P } % 

(19) 

where, as in the last section, <f> denotes the scalar potential and 
(A x ,A y <A z ) the vector potential; the operators p x ,p y , p~ are defined by 
equation (10) and j) 0 denotes the operator In equation (19) 

the quantities a x , oc ir ol z , /? are not ordinary numbers; their nature will be 
defined below. To determine the laws governing these quantities Dirac 
made the additional assumption that for an electron in the absence of 
a field the wave equation (19) would reduce to the Klein-Gordon equa¬ 
tion (14). In the absence of an electromagnetic field both <j> and A 
vanish and equation (19) reduces to 

(Po+oc x p x +<x u p y +<x s p s +p) y Y = °- 
Multiplying this equation by the operator 

-Po+ <*xPx+ <*vPv+* S Pz~ 
we obtain the second-order wave equation 

{— Po+ 2 °%Pl+ 2 («X a ! /+“j l IX x)PxPv + P 2 + 2 (^xP + fax))^ = 0. 
This reduces to the Klein-Gordon equation for a free particle 

( —Pl+Pl+Pv +Pl + rn 2 c 2 ) x Y = 0, 
if the quantities oc x) cx y9 oc z , j8 obey the relations 

= Oil = Oil = 1, 0L x 0L y + 0L y 0i x = 0, etc., \ 

j3 2 = m 2 c 2 , (x x j3-\-f}<x x = 0, etc. 1 ' 

Once the quantities oc x , a y , ot B , and /J have been determined from these 
equations, the wave equation for an electron in an arbitrary electro¬ 
magnetic field can be taken to be (19). It should be noted that the 
wave-function W appearing in this equation need not be a scalar; we 
shall see later that, because of the nature of the a’s and jS, the wave- 
function T must, in fact, be assumed to have four components. 

f P. A. M. Dirac, Proc. Roy. Soc. A, 117, 610 (1928). 
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The Dirac equation (19) assumes a convenient form if we write 

“ (#i, Xfy #3, # 4 ), 

“ C^l* ^3’ ^4)> 

“*> “!/. a 2> —j = («i- “2. «a- “4). 

and put II, = k - - + ~ ' (j = 1,2,3,4). 

I' uXj C 

With this notation equation (19) may be written in the form 

[ 2 °q Ilj—ill 4 -f- a 4 me J T* = 0, (21) 

where now the quantities oq, a a , a 3 , a 4 satisfy the relations 

Ct-\OL v +OL v OLX = 2S Ai; (22) 

(A, y — 1,2,3,4). If we write the three quantities (cq, a a > « 3 ) as a vector 
a and introduce a vector operator II in the same way, we may write 
equation (21) in the form 

(a. II~ iH 4 -f a 4 m,c) x Y = 0. 

It is sometimes more convenient to write the Dirac equation (21) in 
a more symmetrical form by introducing the operators 


Pi = ®a 4 «l* ^2 = *«4 a 2> Aj = i«4 a 3> P* = a 4* (23) 

Multiplying equation (21) on the left by ia 4 and substituting from 
equation (23) we obtain the equation 

( 2 Pj Uj+mefr = 0 . (24) 


The relations (23) give the operators in terms of the operators oq; 
the inverse relations can be readily shown to be 

oq = ip 1 Pi, a 2 = ip 2 /1 4 , a 3 = ifi‘ 6 P A , a 4 = /J 4 . (25) 

53.2. The. Dirac matrices 

So far the only information we possess about the quantities cq 
(j — 1,2,3,4) is that they satisfy the equations (22). From these 
equations we now attempt to obtain a matrix representation of these 
quantities. The equations (22) are, of course, satisfied by the Pauli 
spin operators a xi a y , v z introduced on p. 97, but if we choose these 
operators as three of our a’s we find it impossible to determine the 
fourth. If, however, we extend the Pauli matrices in a diagonal manner 
thus, 

(°i> °z) — 



(26) 
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where o denotes the symbolic vector (a 
matrices 


Pi = 


x , a y , cr 3 ), and if we introduce the 



0 

0 

0 


(27) 


we find that the cr’s and p’s satisfy the relations 
<*r*» + °8°r — PrPs+PsPr = 

Pr Pr 

If now we define a set of quantities <x v by the equations 

a l ^ Pi a 2 ^ Pi °2’ a 3 ^ Pi °3> a 4 ^ Ps> 

we find that all the conditions (22) are satisfied. Performing the matrix 
multiplications we obtain for the a’s the four matrices a 4 = p 3 and 


(28) 


(29) 


oq 



0 -i 
i 
0 
0 


/« 

0 

] 

0 \ 

I 0 

0 

0 

-ll 

i 

0 

0 

0 ) 

\o 

—1 

0 

0 / 


(30) 


From equation (23) we obtain for the j8’s the four matrices /J 4 = a 4 
and 


P3 


^i = 



02 = 


0 

0 


0 

-I 

0 

0 


03 


/ 0 

0 

i 

0 \ 

1 0 

0 

0 

-i\ 

\-i 

0 

0 

0 / 

\ 0 

i 

0 

0 / 


(31) 


Furthermore, it can be proved that any four-rowed square matrix 
can be expressed as a linear combination of the sixteen matrices 

<*2> a 3> a 4> 01> 02> 03> f 

cq a a , a a a 3J a 3 aq, aqa 2 < x 3 , a a a 3 a 4 , a 3 aqa 4 , aqa 2 a 4 , aq a 2 <* 3 a 4 , 
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where / denotes the unit matrix 


( 1 0 0 0\ 
0 10 0 1 
0 0 1 0 I 
0 0 0 1/ 


It should also be noted that all these matrices are unitary and 
Hermitian.f 

The following properties of the a and p matrices are of use in the 
application of the theory to the solution of special problems. The first 
set 

P1P2 — ““Pa Pi — l p 3 

(7j <7.2 —— (Tg O j — 

follows directly from the definitions of the quantities involved. Also 
if B is any vector commuting with a, then by making use of equations 
(32) we can show that 



Substituting 


(o.B)* = B*+i 2 

1,2,3 

B == p-)--A = ~grad + - A 
ci c 


into this equation, we obtain the relation 

^o.p-f ^a| = |p + ^a| + ~ (a. curl A). 


(33) 


(34) 


53.3. The relalivist/ic invariance of the Dirac equations 
Since the quantities a and jS may be represented by matrices of four 
rows and columns it follows that the wave-function Y upon which these 
quantities operate must itself be a column matrix with four elements; 
we shall write it in the form 

Y = 



With this form of Y and the matrix representations of the /Ts given 
by the equations (31) we obtain by direct matrix multiplication 


ft 









f If A is a matrix, A' its transposed (obtained by interchanging its rows and columns), 
A - " 1 its inverse, and A* the matrix whose elements are the complex conjugates of those 
of A, then if A « A*', A is said to be Hermitian and if A = (A*')" 1 , A is said to be 
unitary: see Turnbull and Aitken, Canonical Matrices (1932), p. 33. 
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Substituting these expressions into the left-hand side of equation (24) 
we obtain the Dirac equations in the following form: 

(iK^-et+mc* )y i +cII_y 4 +cII 3 'F 3 = 0 
(in | ^+ mc^ i f 2 +c n + t 3 - cn 3 %--- o 
(ml- e <i,- m ^%+cn_%+cu a % = o 
(ih~ -e<j>- mcj T 4 +cII + 'Fi-cIT, % = 0 


(3(1) 


(36) determines the four scalar wave-functions X F 2 , x Kj, and T 4 . In 
these equations we have written II + , II_ to denote the operators II 1 ± n 2 . 

In the form (36) the Dirac wave equations are unsymmetrical and 
so it is necessary to study their behaviour under a rotation of axes or 
under a Lorentz transformation. The proof of the invariance of the 
Dirac equations under a general transformation of the type indicated 
was due originally to Dirac;f an interesting proof based on the in¬ 
finitesimal elements of the Lorentz group was given subsequently by 
Pauli4 The general formulae are complicated, but it is a simple matter 
to verify the invariance for certain special transformations. 

We consider two observers moving with constant velocity relative 
to one another, one describing his observations of the behaviour 
of an electron in an electromagnetic field by a set of coordinates 
(x v x 2i x d) Xt = id) in a space of four dimensions, the other by a similar 
set of coordinates (x^/x^x^x^ = id 1 ). Then according to the special 
theory of relativity the tw r o coordinate systems are related by the 


matrix equation 
that is, 


cox 


(37) 


= X i 


(j = 1 , 2 ,3,4), 


where the coefficients co iy - are constants. 

The first observer will represent his knowledge of the coordinates 
of the electron by a wave function of the type (35) satisfying the Dirac 
wave equation 

(.2 & IT^+mcj'F = 0 . (38) 

Writing (37) in the form x' = xw 


- i 


f P. A. M. Dirac, Proe. Roy. Soc. A, 117, 610 (1928). 
t W. Pauli, Handbuch der Phyaik , 24/1. 
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and noting that 

a ___ j> a*' 

a# ’ cte' a# ’ dy' dx dz' 
we see that the Uj transform according to the rule 

n = uj-m'. 

Now there are two possible ways in which we may discuss the trans¬ 
formation properties of the Dirac equation. We may regard the wave- 
function Y as an invariant (Y — Y') and the as the components of 
a 4-vector. The components of this 4-vector are, of course, matrices. 
If yS == (jSp/Jg, j8 3 ,/? 4 ) transforms as a 4-vector it transforms like x so 

that pn = p'axv-nv = p'iv 

and the Dirac equation is transformed to 

(2ftn;+imc)Y' = 0, (39) 

which is of the same form as equation (38). 

Alternatively, we can always find a matrix S such that 

n = - S-'U'S. (40) 

If now the jSy are regarded simply as determining the coefficients in 
the four wave equations (36) and are kept constant, they are exchange¬ 
able with 8 since they act on the inner variable only, so that 
(flf-^n'^Y+tmcY - 0. 

Multiplying on the left by S we obtain 

(pn’+imc)(8V) - 0, 

which is equivalent to equation (39) if we write 

Y' = $Y. 

This point of view must be adopted if we consider the Dirac equations 
as field equations, for then Y must be transformed to a new coordinate 
system in a manner analogous to that in which E and H transform 
in the Maxwell theory. 

The form of the matrix 8 has been determined by Darwinf for some 
special two-dimensional linear transformations; the results are tabu¬ 
lated below: 


(i) Rotation through an angle a about the z-axis 


/ coh a 

sin a 

0 

°\ 


/«!*« 

0 

0 

0 \ 

/ —sin a 

cos a 

0 

°i 

s ~ I 

0 

e~i* “ 

0 

0 I 

0 

0 

1 

°r 

l 0 

0 

e iU 

0 / 

\ 0 

0 

0 

i/ 


l 0 

0 

0 

e-* , <7 


t C. G. Darwin, Proc. Roy. Soc. A, 118 , 654 (1928). 
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S 


S — 


/ COS loL sill 1<X 

0 

0 \ 

1 — sin£< 

y. cos Ja 

0 

0 1 

1 0 

0 

cos Ja 

sin Joe 1 

\ o 

0 

—sin 

cos iotf 

-axis 

r cosh ip 

0 

sinh ip 

0 ^ 

0 

cosh Ip 

0 " - 

-sinh iP 

sinh ip 

0 

cosh ip 

0 


-sinh IP 

0 

cosh ip) 


(ii) Rotation through an angle a about the y-axis 

f cosol 0 —sin a 0\ 

0 1 0 0 ] 

i sin a 0 cos a 0 j 

0 0 0 Ij 

(iii) Uniform relative motion along the z-axis 

fl 0 0 0 

^0 10 0 

,0 0 coshjS sinhjSi 
\0 0 sinh/? cosh /?/ 

These three transformations are all that need be considered since any 
Lorentz transformation can be built up from them. The matrices S 
have the peculiar characteristic that their elements involve the angle 
^oc when the axes of coordinates are rotated through an angle a, so that 
S is somewhat analogous to the square root of to. The wave-function 
T* cannot therefore be a tensor. We thus find that the Dirac equation 
of the electron is invariant under Lorentz transformation and yet it 
could only be written in tensor form in a highly artificial manner. 
This is an example of the mathematical principle that the ordinary 
tensor formalism does not comprise all possible representations of the 
Lorentz group. We shall return to this point later (§58 below); for the 
moment we merely observe that the four wave functions can be split 
into two pairs, for instance, (T^, T 4 ) and ('K 2 ,M J ' 3 ) in case (i) above, with 
transformation equations of the form 


y; = a n %+a 12 %\ 
n = a 21 y 1+ a 22 %r 
r' 2 = <y 2 +«*yj 

T'=a*y 2 +«*T 3 r 

i a { j satisfying the relation 

(41a) 

(416) 


a n a i 2 1 j 

(42) 


i U 21 


53.4. The spin of the electron 

If in Dirac’s equation (21) we substitute for the a’s from equation 
(29), we obtain the wave equation of the electron in the form 

j^o+-^+Pi| a -P+-Aj+p 3 racj x F = 0. 

Multiplying on the left of this equation by the operator 

- (po + +Pi (® P + 1 a) + Ps mc > 


( 43 ) 
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proceeding as in §53.1, and making use of the relations (28), we obtain 
the equation 

[- (po+ P+ - c A J + ™ a c 2 + 

‘{(•• p +; A )( p *+7M ft+ 7)("- p+ ; A )). 


+M 


Y = 0. 


The expression in the curly bracket may be reduced to 

ieh( , , . 1 c>A\ . 

- - J-—(E.o), 

where E is the electric vector of the field. Substituting for (a.p+eA jc) 
from equation (34) and writing 

H = curl A 


for the magnetic vector of the field, we obtain finally the wave-equation 
[-(po+^) , + (p+^a)‘+»V+^ (o.H) + ^ Pl (o.E)]'F = 0. 


The Hamiltonian on the left-hand side of this equation differs from 
that of the Klein-Gordon equation by the addition of two terms. 
Dividing these two extra terms by 2 m we obtain an expression for the 
potential energy of the electron due to its spin. The former of the 
terms in this expression for the potential energy 


<r—(®-H) 
2 me 


shows that the electron possesses a magnetic moment of amount 
e#a/2mc due to its new degree of freedom. This is precisely the mag¬ 
netic moment assumed in the Uhlenbeck-Goudsmit model of the spin¬ 
ning electron; in Dirac’s theory the electron spin is a direct consequence 
of the form of the relativistic wave equation for the electron. 

The second additional term in the Hamiltonian shows that the 
electron behaves as though it had an electric moment iefiafemc, which, 
being a pure imaginary, does not appear in the spinning electron model. 
This imaginary quantity only appears when the Hamiltonian (43), 
which is real, is transformed in a rather artificial way so as to make 
it resemble the Hamiltonian of the Klein-Gordon theory, and does not 
appear to be capable of being incorporated into any model of the 
electron. 

We shall now determine the angular momentum operators associated 
with the Dirac equation; we shall assume that the operator representing 
the total angular momentum of the electron is the sum of the orbital 
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and the spin angular momentum operators, and that it is a constant 
of the motion; this means that it commutes with the Hamiltonian of 
the Dirac equation. If we take A to be zero in equation (43) and write 
e^/c F(r), we see that the Hamiltonian reduces to 

H = p 0 +V(r)+ Pl (a.p)+p 3 mc. (44) 

Now the orbital angular momentum L is defined by the equation 

L = TAP, (45) 

and satisfies the relations 

L xPx—Px L x = 0 ' 

L xPy~Pv L x = ih Pz • ( 4 6 ) 

L x p e -p z L x = ihp y , 

Substituting for H from equation (44) we have 

L X H-HL X - Pl {L x (o.p)-(a.p)L x } 

= Pii a • L x p—pL x } 

— ih Pl {<r 2 p 3 —a a p 2 } 

by the use of equations (46). Similar equations may be derived for the 
components L y and L s leading to the vector equation 

LH-HL = ihp x (a A p). (47) 

Similarly we have 

H—Ho 1 — Pi{o-j.(e.p)—(o.pjaj 
= ZipifaaPit—aiPs} 

by the help of equations (32). Thus we obtain the equation 

all-Ha - -2^(0 Ap). (48) 

Eliminating the vector product (oAp) between equations (47) and 
(48) we obtain the equation 

JH-HJ = 0, (49) 

where J = L+4#a. (50) 

Equation (49) establishes that J is a constant of the motion; we can 
interpret this result by postulating that the electron has a spin angular 
momentum defined by the equation 

s = 

or, in terms of the a’s, 

s x = — pia 2 a 3 , s y = — \h,ioi z oL v s z = —\Mol x a 2 , (51) 

It follows at once from these equations that 

8% == si = s\ = (P) 2 , 

x 


6078 
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in agreement with the spin postulated by Uhlenbeck and Goudsmit, 
and that the operators satisfy the formal equations 

s x s y “ s y s x == \ 

of the Pauli-Darwin theory. 

The manner in which the spin comes out in a natural way as a result 
of setting up the proper relativistic Hamiltonian is one of the major 
triumphs of Dirac’s theory. By contrast, in the Pauli-Darwin theory, 
the spin had to be added as an additional physical assumption. 


53.5. The quantization of the total angular momentum 
As a first illustration of the application of Dirac’s theory we shall 
calculate the quantization of the total angular momentum of a single 
electron. This calculation is an extension, to include the spin of the 
electron, of the method in the non-relativistic theory by which we 
classify the states of the electron in terms of the orbital angular 
momentum. The problem is to determine a wave function V F to repre¬ 
sent a state in which the square of the total angular momentum and 
one of the components of the total angular momentum are quantized; 
i.e. we wish to determine a wave function X F satisfying the equations 

= J Z Y = BY, (52) 

where A and B are constants and J is the operator corresponding to 
the total angular momentum. This is made up of the sum of L, the 
orbital momentum and s, the spin angular momentum, so that 


J = L+s. 


If, for simplicity, we write 

L ± = L x AziL y , s ± — s x ±is yi 
then J 2 = (L+s) 2 = L 2 +s 2 +(L + «s_+L_ #+.+ 2J+ tf^). 
Now in polar coordinates (cf p. 87) 


L, = - J, [ras( 8i "^)+5^4]' 

L ± - 


(53) 


We may therefore write the second of equations (52) in the form 

(iS~ + |^ia i a 2 +^j'F = 0. (52 a) 
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Now ioq a 2 y = 
so that equation (52 a) may be written as 

sj(Y v %) = 0 , 

(ihJL + lh+By^%) = 0 . 

Similarly the former of the equations (52) when expressed in terms of 
scalar wave functions takes the form 

( L 2+i«2 -hL a -A)(%,%)+hL + (%,%) - 0, 
where the operators L 2 , L ± , L z are defined by the equations (53). An 
inspection of these equations shows that they form two identical sets, 
one containing only the wave-functions H*, *E 2 and the other only Mg 
and M*. The solution of these differential equations is very similar to 
that of the equation for the determination of the angular function in 
the Schrodinger theory of the hydrogenic atom, and will not be repro¬ 
duced here. The quantized values of J 2 and J z are found to be 
A =j(j+l)fi 2 , B = mh, 

where m = and j = ■£, f,... and is such thatj ^ |m|. 

The solutions of equations (52) for given choices of m and j are readily 
found to be 

4>) 

% = F{r)^±t^ ^) + 0(r)^-±pj i Y j ^ m _ i {e, <f>) 

where Y lm (0, <j>) denotes the surface spherical harmonic 

! )*^(OM 0)e^ (-1 < m < l). 

The functions f(r), g(r), F(r), and G(r) are not determined by the 
equations (52). 



-1 0 0 0\ /%\ /—T x \ 

0 1 0 01/ % J / % \ 
oo-i oJIrJ ~~ I -r 3 r 
oooi/ W 4 / \ wj 
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53.6. Darwin's variation principle 

Darwinf has shown how to derive both the Maxwell and the Dirac 
equations from a variation principle 

88 = 8 JJJJ L dxdydzdt — 0, (55) 

where L is given by the equation 



-[-complex conjugate of this expression 


+ mr.(% T*+'F 2 Y*- T 3 'F*-T 4 'F*)+J T i 4 T 

C 1 

+-4.CF? Y 4 +Y* 'F 3 +'F 3 * 'F a +'F* T x ) 

C 

+ ?A„(-i'Y* %+iT* %-iY* 'F.+iT* %) 

c 

_j_ e 'A z {Wt X F 3 —T* T 4 +'F* ^-'F* v F 2 ) 

C 

-8^[ (<!,,rlA) '-(sI A+8ra<I *)l <“> 

The function L is considered for the purposes of the variation to be 
a function of the T’s, T*’s, A . x , A yi A z , and <f>. If we vary the function 
8 it is readily seen that the Euler-Lagrange equations yield equations 
(36) and their complex conjugates and also the equations 

( v ’-?s)*= 4 ’ re ,i,' F '' i ?’ < 57 > 

^JA X = -IneCV* Y 4 +Y 2 * %+Y* %+Y* \), (58) 

with two similar equations for A x and A y . Comparing these equations 
with the corresponding equations in the Maxwell-Lorentz theory we 
* t C. G. Darwin, Proc. Boy. Soc . A, 118 , 654 (1928). 
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find that the electromagnetic effect of the electron can be represented 
by a charge density — pe, and a current density —ej, where 

P - m 'Fi+Y 2 * %+Y* %+'Y* Y 4 ), (59) 

further the components of the probability current are given by the 
equations j x = —c(T*T 4 + v F*T 3 +TJ^+T* Yj) ) 


jy = +ic('F* T 4 -'F* %+Y* %-Wt Ti) 


( 60 ) 


j z = -c(T* T 4 +'F* %) ) 


provided that the *F\s are normalized according to the rule 

2 f % x Yfdr^ 1 (61) 

j - i J 

to ensure that the total charge shall be — c. 

Witli these definitions of p, j it may readily be verified that the 
continuity equation « 

^4-divj = 0 (62) 

is satisfied. 

An interesting interpretation lias been given to these results by 
Breit.f From the first of equations (30) we have that 



and hence that 

x F*aq T* = l F* %+'W* %+Y* l F 2 +H ‘* x Y v 


Thus we may write the right-hand side of the first of equations (60) 

as X F*(— ca x ) x F. Similarly the right-hand sides of the second and third 

equations can be written as *F*(— col 2 ) x Y, T*(—ca 3 ) l F. Introducing the 

vector „ / x 

a = (<x v ql 2 ,(x 3 ) 

we may therefore write equations (60) in the vectorial form 

j = T*(-ca)Y. (63) 


This equation suggests that in the Dirac theory the vector operator 
—cot represents, in some sense, the velocity of the electron. For instance, 
when we set up the wave equation for the relativistic two-body problem 
(p. 336), we replace the scalar product (v 1 . v 2 ) occurring in the classical 
Hamiltonian by c^od.a 2 ); though this substitution is suggested by the 
analysis given above, its justification lies in the fact that it leads to 
results which are in agreement with experiment. 

t G. Breit, Proc. Nat. Acad. Sci. 14 , 553 (1928) 
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The Dirac theory differs from the Schrodinger theory in that in the 
latter there is no operator for the velocity, only for the momentum. 


54. States of negative energy in Dirac’s theory 
54.1. Solution of the Dirac equations for a free particle 
For an electron moving in free space both the vector potential A 
and the scalar potential <f> are taken as zero in the equations (36). 
These equations then reduce to the form 



The functions 

% = <4 ex P;-(P-r— Wt) (k — 1,2,3,4), (64) 


which represent an electron moving in free space with momentum 
P = (PvP 2 *Pz) an( l energy W, are special solutions of the above 
differential equations if the relations 

( W+mc i )C l +cp 3 Cs+cipt-ipJCi = 0 ' 

(W+mc 2 )C 2 —cpzCi+cfa+ip^Cs = 0 

—mc 2 )C , 3+cp 3 C l +c(p 1 -^ 2 )C4 = 0 1 

(W-mc^Ct-cpzCi+cfa+ipJC\ = 0 , 

are satisfied. In the equations (65), p v p 2 , p a , and W are real and the 
constants C k may be complex. Eliminating the quantities G k from these 
equations, we obtain as a condition for the existence of non-vanishing 
solutions of type (64) the determinantal equation 


IF+mc 2 0 cp a c(Pi-ipi) 

0 IF+mc 2 c^+yjg) —cp 3 

cp a c (Pi—iPt) W—mc 2 0 

c(p k +ip 2 ) cp a 0 IF—me* 

which is easily seen to reduce to the form 

{TF 2 —wi 2 c 4 —c 2 |p | 2 } 2 = 0. 
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Regarded as an equation for the determination of W this equation has 
the two double roots 

W + = +c(m 2 c 2 + p 2 )*, W~ — — c(m 2 c 2 +P 2 )*. (66) 

Only the positive root W + would have any physical significance in the 
classical theory; in quantum mechanics, as we shall see below, both 
solutions must be considered. 

Since both of the roots are double we may choose two of the 
constants C k arbitrarily and determine the remaining two from any two 
equations of the set (65). Suppose we take W to be the root W+ and 
choose Cg and C± to be arbitrary complex constants A and R, say, then 
from the first and second equations of the set (65) we have the solution 

r - -Pa A + (P i- i Pi) B n _ JPi+*P2) A —P » b 

1 JF++mc 2 ’ 2 IF+-fmc 2 

C 3 = A, C i = B 

The number of electrons per unit volume described by this solution is 

= 2 WHAA*+BB*) 

W^+mc 2 

Similarly if we take W to be W~, C\ = (7, C 2 ~ D, we obtain from 
the third and fourth equations of the set (65) the solution 

C\ = C, C 2 = D 

r _ r VzC+{Pi-iP‘t) I) r _ APi+iPi)0-p a D ( 68 ) 

( 3 _ -c - c — w _— j 

which corresponds to 

w* ^ 2 W-( CC*+DD*) 

W~-mc 2 

electrons per unit volume. 

In each case the four scalar wave-functions % are obtained from 
equation (64). 

The existence of solutions of the wave equation for negative energies 
gave rise to much speculation on their proper interpretation, since states 
of negative kinetic energy have not been observed. If they were, the 
acceleration of the electron would be in the opposite direction to that 
of the external force producing it. In the classical relativistic theory 
this difficulty does not arise, since, though the energy of a free particle 
in terms of its momentum is given by either of the roots (66), the energy 
of an electron can only change continuously; thus an electron which 
initially has positive energy can never reach a state of negative energy, 
owing to the gap between ±mc 2 . In the quantum theory, on the other 
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hand, discontinuous energy transitions are common in applications to 
practical problems. The possibility of transitions from a positive to a 
negative energy has been investigated by Kleinf and in more detail 
by Sauter,$ who found that transition probabilities between states 
of positive and negative energy are small for the fields occurring in 
ordinary circumstances. Nevertheless, all theoretical attempts com¬ 
pletely to eliminate such transitions have been unsuccessful. 

The physical interpretation of the states of negative energy is due to 
Dirac,§ who formulated the ‘hole theory’ to avoid the difficulty of an 
electron making a transition from a positive energy state into one of 
negative energy. The first basic assumption of the Dirac hole theory 
is that, in the physical world as we observe it, almost all of the states 
of negative energy are ‘occupied’. In accordance with the Pauli ex¬ 
clusion principle it is assumed that each state is occupied by only one 
electron. The second assumption is that when all the negative energy 
states are* occupied the distribution of negative energy electrons does 
not produce an external field and is consequently not observed. The 
distribution in which all the negative energy states are occupied and 
all the positive energy states are empty may be taken as a zero point 
for the measurement of electric charge, momentum, and energy. 

If we assume that an external field can remove one of the electrons 


with negative energy W = —\W\ and momentum p leaving a ‘hole’ 
in the distribution of negative energy electrons, then, by the second 
of our two assumptions, w^e shall observe the hole as a particle with 
electronic rest-mass and with its other physical properties given by the 


where e denotes the electronic charge. A hole in the distribution of 
negative energy states would thus appear as a particle with positive 
kinetic energy, which is more in agreement with our concepts of the 
nature of fundamental particles. Furthermore, the theory predicts 
that such a particle, though possessing the same mass and numerical 
charge as the electron, will be positively charged. The existence of 
such particles was confirmed shortly afterwards by C. D. Anderson,|| 
who in the course of some observations of tracks produced in a Wilson 
chamber by cosmic-ray particles in the presence of a strong magnetic 
field obtained the photograph of a track which could only have been 


t O. Klein, Zeits.f. Phyeik , 53, 157 (1929). 

} F. Sauter, ibid. 69 , 742 (1931); 73 , 547 (1931). 

§ P. A. M. Dirac, Proc. Roy. Soc. 126 , 360 (1930); Quantum Mechanics (1947), chap. xii. 
|| C. D. Anderson, Science, 76 , 238 (1932); 77 , 432 (1933); Phys. Rev. 43 , 491 (1933). 
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produced by a positively charged particle with the same mass and 
numerical charge as the electron. Subsequent experimental workf has 
confirmed Anderson’s observations of positrons, as the new particles are 
called. 

Dirac’s hypothesis that the positrons should be identified with holes 
in the distribution of negative energy states is in agreement with all the 
experimental facts now at our disposal. In the theory of the positron 
we have, however, to consider an infinite number of electrons occupying 
the states of negative energy ranging continuously from —me 2 to —oo. 
Various attempts! have been made to resolve the mathematical diffi¬ 
culty of handling this infinity, but none of them can be regarded as 
completely satisfactory, though they have all met with some success 
in certain directions. The theory of holes in its present form, though 
not yet completely satisfactory, appears nevertheless to be correct in 
general outline. 

54.2. The production and annihilation of positrons 

The experimental discovery of the existence of positrons in the 
secondary cosmic rays has led to a search for other sources of positrons. 
Shortly after Anderson’s discovery of the existence of the positron it 
was found that when certain substances were bombarded by high-speed 
ions positrons were produced. For example, by bombarding aluminium 
with a-particles Curie and Joliot§ produced a radioactive isotope 
of phosphorus which then disintegrated into a stable isotope of silicon 
with the emission of a positron. Perhaps the most spectacular discovery 
was that positrons can be produced by the passage of hard y-rays 
through matter. This process can be interpreted on the hole theory in 
a simple manner. If the field of the y-ray causes an electron in a state 
of negative energy (— |JF|,p) to jump to a state of positive energy 
(W\ p')> then the transition results in the creation of a pair consisting 
of an electron of energy W = W' and momentum p' and a positron 
of energy W + = \ W\ and momentum —p. This phenomenon is called 
pair production . If the energy of the y-ray is hv, then, since for the 
creation of a pair an energy equal to 2mc 2 is necessary to give the 
particles the correct rest energy, it follows that for the process to occur 
we must have fo, > 2 me 2 . 

t L. Meitner and Philipp, Naturwiss. 21, 286 (1933); J. Chadwick, P. M. S. Blackett, 
and G. P. 8. Oochialini, Proc. Roy. Soc. A, 144, 235 (1934); Nature , 131, 473 (1933); 
I. Curie and F. Joliot, Comptes Rendua , 196, 1581 (1933); J . de Physique , 4, 494 (1933). 

t P. A. M. Dirac, Proc. Camb. Phil. Soc. 30, 150 (1934); W. Heisenberg, Zeits. f. 
Phy&ik, 90, 209 (1934); V. Weisskopf, Kgl. Dansk. Vid. Selak. 14, 6 (1934). 

§ I. Curie and F. Joliot, Comptea Rendus, 198, 254, 408, 559 (1934). 
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Substituting the known values of the electronic mass and the constants 
h , c, we find that for pair production to occur the y-ray must have 
energy in excess of 1-02 Mev. and therefore a wave-length less than 
0*012 A. The excess energy hv—2mc 2 appears as the kinetic energy of 
the positron and the electron. The cross-section for the creation of a 
positive electron with energy W + and a negative one with energy TFL 
has been calculated by Bethe and Heitler.f In the case where all the 
energies are very much greater than the rest energy me 2 of the electron, 
but not so great that the screening of the Coulomb field by the outer 
electrons is appreciable, the cross-section for the creation of a positron 
of energy between W + and W++dW + and an electron of energy W_ 
(= hv— 2mc 2 — lF f ) is found to be 


, jfi/ 4 2.awt W\ + Wl+$W + W_L 2W+W„ 1\ /<Mn 

-sj' <69) 

where $ is defined in terms of the nuclear charge Z by the equation 




ZW 
137m 2 c 2 


For the case of complete screening and high energies the corresponding 
formula is 


dW + 


4 $dW + 


W ^ + W * ±W+W- log 188 




0bvf 


Z § 9 {hvf 




(70) 


The total number of pairs created in a process of this type can be 
determined in the cases of negligible and complete screening by inte¬ 
grating equations (69) and (70) over all possible values of W+. In this 
way we obtain for the cross-sections for pair formation 




Corresponding results can be obtained J by numerical integration in the 
case of energies for which the screening is partial. The theory of pair 
formation developed in this way has been compared with experimental 
data§ and found to be in good agreement with the observed results. 

Pairs can be formed in several other ways. They can be formed 
during the passage through matter of a proton or an a-particle whose 


+ H. A. Beth© and W. Heitler, Proc . Roy, Soc. A, 146, 83 (1934); H. A. Bethe, Proc, 
Camb. Phil. Soc. 30, 524 (1934). 

J Loc. cit. and H. R. Hulme and J. C. Jaegar, Proc . Roy. Soc. A, 153, 443 (1936). 

§ W. Heitler, Quantum Theory of Radiation , pp. 201-2 (Oxford, 1944). 
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kinetic energy exceeds twice the rest energy of an electron,! or by 
the motion of an electron through the field of a nucleus.! Furthermore, 
an electron pair can be created in the field of a nucleus by a y-ray 
emitted by the nucleus itself,§ or by the action of two light quanta 
whose total energy exceeds 2me 2 . Finally, if an electron of energy 
greater than 7mc 2 collides directly with an electron at rest an electron 
pair can be created.|| 

Since a pair consisting of a positron and an electron can be created 
we would expect that they can also be destroyed. In the hole theory 
this would correspond to the transition of an electron from a state of 
positive energy to a hole in the distribution of electrons in negative 
energy states. The energy of the positron and the electron is given 
out in the form of radiation. The most common process of annihilation 
is that in which a positron combines with a free or lightly bound electron 
and their energy is taken away in the form of two photons. If the 
momenta of the colliding particles are equal and opposite the photons 
have equal frequencies, but if the electron is at rest the two photons 
have different frequencies. Since the energies of the colliding particles 
must exceed 2mc 2 , a simple calculation shows that the wave-length 
of the annihilation radiation must in the former case be less than 
0*024 A. The cross-section for such a process in which a positron of 
energy W+ collides with an electron at rest has been calculated by 
Dirac,|f who derives the formula 


♦= m 

where y denotes the ratio W+/mc 2 . For very high energies this equation 
reduces to the simple form 

* - (’ 2 ) 


This theory of two-quanta annihilation has been verified experimentally 
by Klemperer.!! 

An alternative process is the combination of a positron and an 
electron near a nucleus with the emission of a single photon which acts 
as a carrier for the total energy of the two particles. The case which 
can be calculated most easily is that in which the electron is bound 


t W. Heitler and L. Nordheim, Joum. d. Phya. 5, 449 (1934). 
J For references see W. Heitler, op. cit., p. 203. 

§ W. Heitler, op. cit., p. 204. 

|| F. Perrin, Comptea Rendua, 197, 1100, 1302 (1934). 
ft P. A. M. Dirac, Proc . Camb. Phil. Soc. 26, 361 (1930). 
tt O. Klemperer, ibid. 30, 347 (1934). 
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in the K -shell of an atom; the formula for the cross-section of this 
process corresponding to equation (71) is 


<t>k = 


4 t re*Z 5 1 f 2 2 4 y+ 2 

1 MhnW (y+l) 2 (y*-l)i[ Y + 3 r + 3 (y 2 -!)* 


log{y+(y 2 -l) 1 } , 


(73) 


where as before the energy of the positron is W + = yrnc 2 . For large 
energies this reduces to 


, 47re 4 Z 5 [\ 1 , " 

~ 137 4 m 2 c 2 y[ “y 2 ° g 7 \ ‘ 


A comparison of the formulae (72) and (74) shows that the cross- 
section for the annihilation of one photon is always smaller than that 
for the annihilation of two. The maximum value of the ratio 
occurs when y is approximately equal to 10; in lead this gives 

tktt - 0 - 20 . 


54.3. Relativistic theory of the Compton effect 

One of the most striking proofs of the fact that the states of negative 
energy arising from Dirac’s equation must be considered as well as 
those with positive energy is provided by the relativistic theory of the 
Compton effect-t The radiation process considered in the Compton 
effect is the collision of a quantum of light of momentum k 0 with a 
free electron of rest-mass m which is initially at rest. In calculating 
the transition probability from this initial state to a final state in which 
the electron has a momentum p and the light quantum a momentum k 
we must consider intermediate states in which the energy is not con¬ 
served though the momentum is. It is found that the only possible 
processes are the following: the electron absorbs the quantum k 0 and 
acquires the momentum p' = k 0 and then emits a quantum k in the 
transition to the final state; alternatively, the electron first emits a 
quantum k changing its momentum to p" = — k and then absorbs the 
quantum k 0 in the transition to the final state. 

Now to each of the values p of the momentum of the electron there 
are altogether four states corresponding to the two directions of spin 
and the positive and negative values of the energy. If we take the 
negative energy states as intermediate states as well as the positive 
ones we obtain for the differential cross-section the Klein-Nishina 

formula} mk k 1 

** = ^^|[| o +~-2+4cos 2 0j ) (76) 

t O. Klein and V. Nisliina, Zeits. f. Physik, 52, 853, 869 (1929). 

X W. Heitler, Quantum Theory of Radiation , 2nd ed., § 16 (Oxford, 1944). 
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where 0 is the angle between the directions of polarization of k and 
k 0 and r Q denotes the classical electronic radius e 2 /rac. If 6 denotes the 
angle between k 0 and k, the relativistic relation between energy and 
momentum and the conservation of energy and momentum lead to 
the equation 2 

^ ~~~ rac 2 +& 0 c( 1—cos 6) ^°* 

If the primary radiation is unpolarized and of intensity I 0 the 
intensity of the scattered radiation at a distance R from the scattering 
electron is then readily shown to be I(9)dil, where 1(6) is given by the 
formula 

m _ ij (ro\ 2 _ 1_f l4 _ _y^l-cosfl) 2 _ 

^ } 2 0 \r) {l+y(J~cos0j} 3 i ‘ (1 +cos 2 0){l +y(l —cos0)} 

(77) 

and y = k 0 jmc. The intensity distribution for a radiation of wave¬ 
length 0*14 A (y = 0*173) in carbon has been measured by Friedrich 
and C5oldhaber,f and it is found that the curve for 7(0)// o derived from 
these measurements is, within the limits of the experimental error, in 
exact agreement with that determined from the above equation. 

The total scattering may be obtained from equation (77), by inte¬ 
grating over 0, and the results compared with the experimental values. 
It is found that the theoretical curve so determined agrees well with 
the experimental values at least for energies up to 10mc 2 4 On the 
other hand, other relativistic wave equations, such as the Klein-Gordon 
equation, lead to results which are not in agreement with the experi¬ 
mental measurements. The Klein-Nishina formula may be regarded as 
proved, at least for energies in the range considered, and its exactness 
considered as direct evidence in favour of the Dirac theory of the 
electron, when the negative energy states are treated with the same 
weight as those of positive energy. For, if we entirely disregard the 
intermediate states of negative energy, the formula we obtain for the 
Compton scattering is appreciably different from the Klein-Nishina 
formula and is no longer in agreement with the experimental evidence. 

55. The relationship of Dirac’s equations to previous theories 

The fact that in practical problems two of the scalar wave-functions 
(Jc — 1,2,3,4) have usually greater magnitude than the remaining 
pair enables us to develop the Dirac wave equations in an approximate 

t W. Friedrich and G. Goldhaber, Zeita.f, Physik, 44, 700 (1927). 

% W. Heitler, loo. cit., p. 160. 
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manner which brings out their relation to the non-relativistic theories 
of Schrodinger, Pauli, and Darwin considered in previous chapters. 

If we wish to find a periodic solution 

% = 

of the scalar wave equations (36), then we have to solve the system of 
equations 

(W~ 4+^ c2 )x 1 + c (n 1 -in 2 )y 4 +cn 3 x 3 = o\ 
(Pf_^ +mc 2 ) X2 4. C (n 1 +in 2 ) X 3-cn 3 x 4 - o I 
(Jf-~4-.mc 2 )x 3 +c(lI 1 -iIl 2 )x2+cII 3 Xi = o j 
(W ~^~mc 2 )x4+c(II 1 +/n 2 )xi"-cn 3 x2 = o / 

Now in the case of positive energy states, if the velocity of the electron 
is small in comparison with the velocity of light, W is approximately 
equal to the rest energy me 2 of the electron; therefore 

~(IP— eA+mc 2 ) 2 me. 

c 


Substituting from this equation into the first two equations of the above 
set we obtain immediately in the case A — 0, 



If we substitute these values of xi and X 2 into the third equation of 
the set (78), we find that xz satisfies the equation 

V a Xs+^(W r -mc a .-e^)x3 = 0. (80) 


Now if W is nearly equal to me 2 , the quantity W—mc 2 represents the 
kinetic energy W' so that xz is a solution of the non-relativistic 
Schrodinger equation 

(v 2 +~(W'~V)Jx = 0. (81) 

The same result holds for xa • It is clear from equations (79) that |xi( 
and |xa I are small compared with | xsl and |x 4 |; thus, in a first approxi¬ 
mation, they can be neglected in the expressions (59), (60) for the 
charge and current density. We then have approximately 
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These results provide an approximate solution of the Dirac equations 
(36); if Y is a solution of the non-relativistic Schrodinger equation (81), 
then the following are approximate solutions of (36): 

%=-C l Y, Y 4 ^-0 2 Y, 

xp __ c*(ni —i 112)+n3___ QWi+ilhh 0 * n _ 3 T 

1 2 me ’ 2 2 me 

where C v C 2 are arbitrary constants and U v IT 2 , II 3 are interpreted in 
the usual way as operators. 

For negative energy states the roles of Y 3 , Y 4 and Y 3 , Y 2 are reversed. 

To show more fully the relation of the Dirac equations to the non- 
relativistic theory of Pauli and Darwin, we introduce the two wave- 
functionsf 


Xi 





y z=z 


Xa 


(82) 


The Dirac wave-function T can then be written in the form 




% 

% 


where A denotes the matrix 

A = = 


= (Xl~ A X2) e 


0 

0 

0 

-1 


imc^ljh 


-1 

0 

0 

0 


(83) 


(84) 


Differentiating both sides of equation (83) with respect to the time, 
we obtain - 

0T I or- A — \ 1 -7 (CXl 

dt ' 


ill 


mc 2 (xi-Ax 2 )+^ 


dt { 

Now Y satisfies the Dirac equation 

d 




dt)\ 


(85) 


i 


and since 


and 


M f t + e< l > + c .2 a i H i + «4 rnc2 


= 0 , 


( 86 ) 




f We consider here the case of positive energy states only; the analysis is similar for 
negative energy states, cf. E. L. Hill and R. Landshoff, Rev. Mod. Rhys. 10, 120 (1938). 
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we may write oq xi = — Aa x 

or, in general, «*Xi = ~^kXv “ 4 X 1 = Xi> ( 87 ) 

where k = 1, 2, 3. Similarly 

«fcAx 2 = — a k x 2 , «4 Ax2=— A Xa- ( 88 ) 

Substituting from equations (85), (87), and (88) into the Dirac equation 
(86) we obtain 

( c a j Xa+»* + (2mc 2 +^)xi| + 

+A !!_, aj Xi+M ~ = <>• 

Since the operator A and the unit operator are independent, each of 
the expressions in brackets must vanish. Tn this way we obtain the 
equations , ^ v 

[ill ^ + 2 mc 2 +exfj fa +r(a. U)fa = 0, (89) 

!*&■£ + e <tj Xz+ C ( a • n)xi = 0, (90) 

where we have written 

(o.n)= i aj n,.. 

J = 1 

Now we can write equation (89) symbolically as 

- = _ (o.n)x, _ 

Xl 2mc[l +{in(dldt)+e4}l2mcr) ’ 
showing that jf, is given approximately by the equation 

* “ -i (c ' n) ^+4Jv(^4 + ^) (0 - n)xV (91) 

Substituting from equation (91) into equation (90) we obtain 


h dt 

where § = — e</>+-^(a.II) 2 — 

Milt 

Now from equation (34) 


= $X 2 > 
4m 2 c 2 ’ 



(o.H) a = n 2 -f-— (o.curlA) = n a +~(e.H), 
c c 


( 92 ) 
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since by Maxwell’s equations H = curl A. Furthermore, 

and 

(®.n)(«.E) = (E. H) -J-v div E—i(c. E A H) +&(c. curl E) 

% 


(E. H)— ?'(a . E a H) - 


b( dU\ 

~C\ dt)’ 


since 


divE = 0, and curlE — —- —. 

c dt 


Substituting these results into equation (92) we obtain 


ib 


4m 2 c 2 


[(E.H)—*(<j.E A H)—*(<,.—)]. 

To this degree of approximation we may write (o. H) 2 /2m for 
so that we have finally for the Hamiltonian § the equation 


* = S,+&+Jfj(»-H)- 


be 


4m 2 c 2 


(cj.EaH)- 


ib 

±m 2 c 2 



where denotes the Hamiltonian operator of the non-relativistic 
Sehrodinger equation, viz. 





and be(a .H)/2mc is the energy produced by the spin of the electron 
in an external field. The term 




8 m 3 c 2 


[n*+|(..H)]’ 


is a relativistic correction term. The fourth term 


be 

4 m 2 c 2 


(o.EaH) 


is the term attributed by Thomasf to the spin orbit coupling. The 
remaining terms have no counterpart in the non-relativistic theory. 


fi072 


f L. H. Thomas, Phil. Mag. 3, 1 (1927). 
Y 
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56. Solutions of Dirac’s equation 
56.1- Rectilinear motion of an electron 

We now consider the motion of an electron parallel to the o^-axis in 
a field that is a function of x only. We suppose that the vector potential 
A is zero everywhere and that e<f) — —V(x). The wave function may 
be written in the form 

Y = x(a0exp^(3J 2 y+4> 8 z— Wt). (93) 

Substituting this expression into the Dirac equation (9) we obtain 

+ « 2 p 2 + a 3 p 3 +« 4 wc ] Xfr) = °- 

Written in terms of four scalar functions Xfi( x ) (/*=■= 1,2,3,4), this 
equation becomes 

~ Xi +ip 2 Xi-n Xi~ mc Xi = 

+ W ~J {X) Xs-ip*X>+PsXi-™X* = 0, 

~ in ia + Xi+iPi X 3 —P 3 Xi+ mc X2 = 0, 

- ^ p x - Xi - nh Xi+ih Xz+ mc Xi = o. 

If we substitute the expressions 


Xi ~ ~F^ 2 ’ 


Xi = & + *4 = ^ 1+^2 

into the second and third equations of this set of differential equations, 
we find that they reduce to 

d<f> 2 i W—V{x) x , k x k[ n , p t +imc\ A A 

te-h-—r-^ + r i+ h( a+ —jr-r 0> 

ty i W-V(X), , k, , k (. 3) a +imc\, _ . 


• rll w rr r / I , 

dx h' c 




These equations can be written in the form 

d<f> 2 i W-V(x) k _ 
~dx~h •— 

if a, 6 are chosen according to the equations 

o = —{p 2 +imc)jk, b = ( p 2 +imc)/k. 
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With these values of a, b inserted in the expressions for xv X 4 we 
substitute for the y’s in the first and fourth equations of the original 
set. Taking account of the equation (94) we find that both equations 


reduce to 


i W-V(x) x k A 
— +r —— 


We have thus reduced the problem to that of solving the two first- 
order differential equations (94) and (95). When the functions (f> x and 
<f >2 are determined from these equations, the scalar wave functions are 
obtained from the relations 


'Fi = ^ 2 (rr)| exp p., y+p a s— W /■), 

% = -j~ < f>2( x ) ex Y , jAP2 y+p^—wt), 

^3 = -^faWexVj&zy+Pzz-wt), 

% == |<^ 1 (x)+^~-> 2 (*)jexp~(p 2 y+jp 3 z-JF<)- 

The case of a uniform field V(x) = ax has been treated by Sauter,f 
that of a simple harmonic oscillator, V(x) = cox 2 , by Nikolsky,J and 
that in which V(x) is a polynomial of any degree in x , or in 1/a:, by 
Plesset.§ 

56.2. The energy levels of the hydrogen atom 

The most important success of the Dirac theory, apart from the 
solution of the problem of the spin of the electron, has been the 
derivation of the fine structure of the energy levels of an electron 
moving in the Coulomb field of a single nucleus. Dirac,|| using the 
methods of non-commutative algebra, showed that SommerfekTs fine 
structure formula for the energy levels of the hydrogen atom was a 
natural consequence of the use of the relativistic wave equation. The 
same result was later derived by Darwinf f and by Gordon by more 
familiar methods; we shall follow the latter treatments here. 

The Coulomb field of an electron moving in the field of a nucleus 
of atomic number Z may be derived from the potentials 

A = 0, <f> — Ze 2 jr . 

f F. Sauter, Zeits.f. Physik , 69, 742 (1931). 

J K. Nikolsky, ibid. 62, 677 (1930). 

§ M. S. Plesset, Phys . Rev. (2) 41, 278 (1932). 

|| P. A. M. Dirac, Proc. Roy . Soc . A, 117, 610 (1928). 

tt C. G. Darwin, ibid. 118, 654 (1928); W. Gordon, Zeits.f. Physik , 48, 11 (1928). 
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We substitute from these equations into equations (36); introducing 
polar coordinates r, 8 , (f> defined by the equations 

x = rsin0cos<£, y = rsin0sin<£, z = rcos0, 
we obtain 


d . d 

. -4— % —— : 

8x r dy 


e^|sin 0~4 

a . 3 .,r. „ 

- i — = e~ l< ? I sm 6 

'&x dy L 


cos0 d 


+ i- 


r d8 rsin8d<f> 
d , cos 0 d . d 


9 


—- SS5 COS C 
& 07- 


fir ' r fifi r sin 8 c)<j> 

sin0—. 
rdd 


]• 


We thus obtain for the scalar wave functions the following equations: 

d , cos0 d i d^ 


~Ft ^ ~r~ mC ' 2 ) ' ihGe^i ^sin 0 ~ - 


(i/i ~ ~ + me 2 j T 2 —|sin 0 


d , cos0 d 


r dd r&mddf/)^ 
iftc 

\ dr 
i d 


;)T 4 - 


(cos0—— sin# n . 
1 fir rfifl 


i)^ = «• 


dr 


^ 3 + 


(iK~ -|- —— rncA 4^— ilice^ (sin 0 — - 
\ dt { r J \ dr 


8 , cos0 £ 


r 88 ‘ rsin0 d<j>) ~ 3 

4~^c(cos0^~sin0-^W 4 = 0, 
\ dr rd8) 4 ’ 

_ * ^ \ \p_ 

r S0 rsin0ety/ 2 

iTic(cos0-|- —sin0-?-W 1 = 0, 

\ dr rd0/ 1 

_ i ^ W { 

dr * r 88 1 rsin0 d</>) 1 

+7lc(cosfi|-8infi-l)T 2 = 0. 

In these equations we substitute 

4; - %(r 3 0, <f>)ex V (-iWt/h) (f* = 1,2,3,4), (96) 

and express the function 4^ in terms of F, /, G f g by equations (64). 
As a result we obtain, after some reduction, the system of differential 
equations 


t 


ih y- 4 - ~— me 2 j 4 4 —ificerf |si 


dt 


sin 0 


d , COS0 5 


dr 

dG 

dr 


+( j '+1);+e( ,f +t-’“ 1 ) 0 = 0 


( 97 ) 
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dF 

dr 


+ 


dg 

dr 


[W+^+mc* 


9 = 0 


N) 

H )'+ s (" r + T -" K, ) J '- 0 J 


(98) 


The differential equations for the four functions /, < 7 , G thus fall 
naturally into two pairs. Consequently for each permissible value of W 
we can find two independent solutions for the T-function by equating 
to zero either of the pairs of functions (/, 0) or {F,g). 

The solution of the pair of equations (98) is similar to that of a pair 
(97), so we shall consider in detail only the solutions of (97) here. 
If we make the substitutions 


£ = Ze 2 jhc, e = mc 2 /W, A = mc(l— e 2 ) 5 /#, p = 2Ar, 
f(r) = i(l- f )ic-iV-U(p)-A(p)} j 

0 (r) = (14 e^e-iPpy-y^+Mp)} V 
the pair of equations (97) is transformed to 


dfx 

dp 

djj 

dp 


f 1 p[ y + J(\- e *)j\ fl + p{ J + 2 « 2 )} /a 


( 100 ) 


If we assume the functions /j(p), hip) to be given by power series of 
the form «, 

flip) = 2 c sP a ’ hip) = 2 d aP*> 

8 = 0 8 = 0 


then substituting for the functions in equation ( 100 ) and equating 
powers of p 8 " 1 we obtain the recurrence formulae 

(101) 

4 ' + ^+ jni ?))- 4 +s -^))“ 0 i 

for the coefficients c a , Since we are considering only positive powers 
of p we may take s = 0 , c L == 0 in equation ( 101 ) to obtain the equations 


( r+ V(l-«•))"• ( J+ 2 

(^+5+7(i3?j) c »-(>'-7(CTj) ,i " - °- 
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The necessary and sufficient condition for non-vanishing solutions of 
the type (99) is therefore 

y-f- . jf- _^_ = 0. 

r+ V( 1 -« 2 ) j 2 + V(!-« 2 ) 

i + i + _y + _J1_ 

■^^2 ‘ ^(1—e 2 ) 7 

Regarded as an equation in y this determinantal equation has two 
roots, equal in magnitude but of opposite sign. Discarding the negative 
root since it would lead to wave functions which behave as r~ n (n > 1) 
in the neighbourhood of r = 0 and hence give a divergent integral for 
J | l F| 2 dr, we find that the value of y is given by the positive root 

y - {(j+i)*-**}*. ^ (102) 

Since j > 1 and £ = Z(e 2 /fic) — Zj 137 < 1, it follows that y is always 
real. 

Eliminating d 8 from equations (101) we obtain the recurrence relation 

, = , 

'* (y+^-y+iy+P''' 1 ' 

n'-l 

If the function f x (p) is to be a polynomial of the type c sP* which 

8 = 0 

the last term does not vanish, then we must have 

y-fw'— r-^—s- = 0. 

Regarded as an equation in e this equation has roots 

6 _ +_y±?__. 

± J{P+(y+n')*} 

Taking the positive root of this equation and defining the principal 
quantum number, n , by the equation 

n = n'+j+i 

we obtain IT = mc 2 |l-|-(103) 

l (n-J-b+y) 2 ) 

Subtracting the rest mass of the electron and substituting for £ and y 
we obtain Sommerfeld’s fine structure formula 

n " “ ’"'[ I 1 ll04) 

For small values of Ze 2 /Hc this formula reduces to the non-relativistic 
formula w , Z*me* 1 


y+n' — 


6 = ±. 


we obtain 


W = me 2 -1 + 


W /Mr 2 1 jl -|- 


' ] ]‘ (104) 


Z 2 me 4 1 
'~Wn 2 ' 
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The analytical forms for the functions f t (p) and Up) can be determined 
easily; it is found that when W is given by equation (103) j\ and f 2 are 
given by the formulae 

f 1 ( P ) = 1 F 1 (l-n',2y+l; P ) 
y+n'+{j+ i)e 


Up) 


n e 




(105) 


where the hypergeometric function ^ is defined by 

F( a ft- x) - V T (P) r (*+ n ) *1 
1 l( ’ P ’ )_ 2r( a )r()3+n)n!.- 

If we substitute for f v f 2 from equation (10*5) into equation (99) and 
transform back to the original variables, it can be verified easily that 

|G|>I/I- (106) 

The solution of the pair of differential equations (98) may be carried 
out in a similar way. If we write 

F = *V(l+ € ).e-V-W)-W}> 9 - V(1 

and proceed as before, we find that the condition (102) and the 
Sommerfeld formula (103) are obtained once more, and that 
y+n'— (j+£)e. 


Fi(p) = 


-Up), f 2 ( p ) = Up). 


y+w' + (j+l)e 

Corresponding to equation (106) we then have the relation 

1^1 > \g\- 


(107) 


(108) 


56.3. Motion of an electron in a uniform magnetic field f 
Consider the motion of an electron moving in a uniform magnetic 
field H in the z-direction. Then the potentials are given by 
<t> — 0, A = \Hx, 0), 

so that the Dirac equation (43) reduces to 

[Po+PiiUPx-^+UPv+^+^Pzl+Pz^'F = °> 
where we have written a> — eHf2c. If we assume that the wave function 
T is independent of z and write 

T — ^(x)exp^(o)xy+rjy— Wt), 
we find that <f> satisfies the equation 

^+/ > i{ <T *2 , ®+ <T i/( 2a,a: + , 7)}+/ , 3 mc J^ — °- 
f L. D. Huff, Phys. Rev. (2), 38, 501 (1931). 
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Substituting x' = x+7]/2co and writing out the four component equa¬ 
tions we obtain the system of ordinary differential equations 


2W ^ 4 +(7' + mc ) ^1 = °’ 

* + 2i(ox'<f> 3 + (~r + mc )^ = °> 


i dx‘ 


rlr' _w * i+ (—>«)*.=°- 


% < 2 ^ 
i da; 


( pf 

——mc 


cj^ 4 


0 . 


Eliminating < 7 ^ between the first and last of this set of equations and 
writing 


f - 2x'(co/h)* 9 


1 (W 2 
4cjft \ c 2 


-m*c 


2^2 


we obtain the differential equation for a harmonic oscillator 






(109) 


whose solution may be written (cf. Appendix, § 2) 

h = ai>„(£) = ae-^UM), 

where //„(£) is the Hermite polynomial of order v. 

Similarly if we eliminate <f> A between the first and last equations of 
the set we obtain <f> x = bD v „ x ((;). The two solutions are not, however, 
independent. Substituting the two expressions for <^» l5 </> 4 into the first 
equation of the set we obtain a relation between a, b. In this way we 

obtain Wlc-mc 

U*) = aDM M*') = 

as the expressions for the functions ^ and c/> 4 . 

It can be shown in a similar manner that 

&(*') = *0, 

where C is an arbitrary constant. 

The current densities in the field are given by 

j x - = -(l+cV)(^4+<^i). 

jy - Y = 

W/c—mc 


= ( 1 + c t p a )aa*D t , D v _ x - 


(M* 


where 


p 2 = W'lct-mW. 
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From the properties of the D-functions it follows that j y is negative 
for some values of x; this is apparently a consequence of the spin, since 
the corresponding expression in the Schrodinger theory turns out to 
be always positive.! 

Using the results (cf. Appendix, § 3) 

00 00 

J Am-1(£)« = { T(PlVy » / 

0 0 

we find that the average ^-coordinate of the current defined by 

? /? 

* = J xjydx J jydx 

0 0 

has the value x = 1 ^ KJ -I 'I . 2v(2tt)K (110) 

4 \to/ v! 

Now the classical value of this quantity is 

X = 777* /4, 

where r = cp/eH. The quantum mechanical value reduces to this if we 
use Stirling’s formula 

n! — (27m)* 

The error in the Stirling formula is of the order 1/y, i.e. 1()~ 8 to 10~ 10 , 
so that the deviation of the classical formula from that given by the 
Dirac theory is entirely negligible. 



56.4. The scattering of fast electrons by a nucleus 
If a beam of electrons with velocity comparable to that of light falls 
on a single atomic nucleus, Dirac’s equation of the electron must be 
used to calculate the scattering. We have in fact to find a solution 
V F = (T 1? \P 2 , of this equation which represents an incident plane 

wave and a scattered wave, and thus which has the asymptotic form 

piprjfi 

'¥ k ~C k eM*+u k (8,<f>)- - (k = 1,2,3,4) (111) 

r 


for large values of r. The constants G k are not all arbitrary constants; 
it follows from equation (67) that if we take 

<7 3 = A, C 4 = B , 

where A, B are arbitrary complex quantities, then 


_Ap_ c _ Bp 
p+mc’ 2 p+mc’ 


(112) 


t G. E. Uhlenbeck and L. A. Young, Phys. Rev. 36, 1721 (1930). 
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where we have written p for W/C. We shall choose A, B such that 
AA*-\-BB* — 1. As already noted the scattered current depends on 
m 3 (0,<£) and v A (0 3 (f>) only, so that, if I(8)nt sin 9d8d<f> is the proportion 
of the original beam scattered in a given solid angle, we have 

/(*)= l*i(M) IM-KMI 2 - 

Furthermore, if we take the direction of the spin to mean the direction 
referred to axes with respect to which the electron is at rest, then 

-|-<*•**** ( iis ) 


where oj are the spherical polar angles of the spin direction. 

Now it can be shown that the general solution of the form (111) is 

given by U 3 (d, 4>) = Af(9)-Bg(d)e-i* j 

« 4 (M) = Bm+Ag(0)e<+ Y 


(114) 


where f(6) and g(6) are functions of 8 alone and depend on the form 
of the potential function V(r) describing the interaction between the 
electron and the nucleus. Thus 


1 ( 8 ) = 

+ (115) 

so that if A, B satisfy the equations (112) and (113) we have 

1(6) = |/(0)| 2 +l^)| 2 +^)sin X sin(a>-^), (116) 


where the function D(8) is defined by the relation 

m = i{f(8)g*(0)—f*(O)g(9)} (117) 

and w determine the direction of the spin axis of the incident 
electrons. 

If the incident beam is unpolarized so that the spin axes are pointing 
in all directions, we average over all possible directions of the spin axis 
and obtain 




(118) 


In the general case the function P will depend on the polarization of 
the incident beam, and even if the incident beam is unpolarized the 
scattered beam will in general possess a polarization. 

The functions f(9) and g(8) for scattering by an atomic nucleus with 
inverse square law field 

V(r) = Ze 2 /r 

may be determined*)* from Gordon’s solution of the Dirac equation for 


t Cf. N. F. Mott, Proc. Roy . Soc. A, 124, 425 (1929). 
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the hydrogen atom (§ 56.2 above) and from the asymptotic expansion 

m 




r(js-«) 




m 

r(«) 


e*z“-P 


2at) > 0) 
—2a) < 0) 


and |arg(— z)\ < 77 , |argz| < rr. For the details of the calculation the 
reader is referred to the original paper; we shall only give the result 
here. The functions f(6) and g(6) are such that 

m= \m\ 2 +\ g m* 


Z 2 c 2 rf, 

- 4^2 


^(1 — )3 2 )(coscc 4 |0—j8 a cosoe 8 4S+wa/3-\—i-rff+ (^(« a )j (1 19) 
{ “ sur W ) 


and 


Z 2 r 2 


fg*—f*g = (1 cosec 0 log( cosec | 0 ), 


( 120 ) 


where to, e denote the mass, velocity, and charge of the electron, Z the 
atomic number of the scattering nucleus, and a = Ze 2 /fic, j 8 = v/c, and 
r 0 = e 2 /mc 2 . 

The formulae (119) and ( 120 ) determining the total scattering and 
the polarization of the scattered beam are calculated with neglect of 
radiative forces which are appreciable for fast electrons. 

If we let f}->0 in equation (119) we obtain the non-relativistic 
formula , 7 % \ 2 

m 


I Ze 2 \ 2 ... 

w) oW1 *- 


Now the number of particles N scattered in the angular range 
(O v 0 2 )lS 2v g, 

nt J d<f> j I(6)smd dd, 

0 e x 

so that substituting for 1(6) from (119) we obtain the formula 
* - 

-f 2 mx/?(sin J^—sin | 0 2 +cosec cosec J 0 2 )-f 0(cx 2 )J, ( 121 ) 

which reduces to the well-known result 

\2 


N 


\mv 2 J 


(cot 2 \6 X — cot 2 %0 2 ) 


in the non-relativistic approximation /? 0. 
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For large values of Z and small angles, 0, we have approximately 

m = (S) 2(i -^ )cosec4 ^- 

For large values of Z and large angles there is no simple formula for 
the scattering cross-section. The scattering has, however, been calcu¬ 
lated for gold at a scattering angle of 90° and for various velocities. 
We can express the scattering at 90° in the two forms 




(122) 

or 

m 

(123) 


where R and S are numerical factors which are functions of v/c. The 
values of R, 8 are given in tabular form below: 


Table 9 


P 

0 2 

03 

0 4 

0 5 

0-6 

07 

0-8 

0-9 

10 

R 

M 

1-2 

1-4 

10 

1-9 

2-2 

2-6 

30 

3-4 

8 

1-0 

M 

11 

1*2 

1-2 

11 

0*9 

0*5 

0 


There have been a number of experimental investigations of scattering 
of fast electrons by nuclei, mostly by the method of the expansion 
chamber. A review of the position in 1938 was given by Champion.f 
For more recent work, see RandelsJ and co-workers. Scattering has 
been investigated in H, N, F, A, Kr, I, Xe, and Hg, for energies usually 
up to about 3 Mev., though for xenon recent investigations have gone up 
to 12 Mev. Except for some early work on mercury, discrepancies with 
theory are seldom as great as 2, and may be due to experimental error. 
However, there is little work on large angle scattering with heavy 
elements. 

56.5. The polarization of electrons by double scattering 
In § 53.4 we saw that formally the electron has a magnetic moment 
of magnitude eK/2mc. When the electron is bound in an atom this 
magnetic moment cannot be observed directly; all that can be observed 
is the magnetic moment of the atom as a whole, or the splitting of the 
spectral lines which we may attribute to the magnetic moment of the 
electron. It is natural then to inquire if a free electron possesses a 
magnetic moment and if there is any experiment by which we may 

t F. C. Champion, Rep. Prog. Phys. 5, 366 (1938). 

% R. G. RandeUi, K. T. Chao, and H. R. Crane, Phys. Rev. 68, 64 (1946). 
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determine this moment. It can readily be seen that a magnetic moment 
eh/2me can never be observed directly, for instance, by the use of a 
magnetometer.f Suppose, for example, that the free electron whose 
spin is to be observed is at a distance r from a magnetometer. Then 
the magnetic field due to the spin moment is of order 

eh/mcr 3 

and the magnetic field due to the motion of the electron is of order 

ev 1 
c r 2 ' 


If the effect of the spin is to be observable it must be greater than 
the uncertainty in the field due to the electronic motion, so that we 
must have 


h 


mr 


> At’, or 


h 

rnAv 


>r } 


(124) 


where Av is the uncertainty in our knowledge of v. Now by the un¬ 
certainty principle we cannot observe both r and v at the same time; 
the uncertainties of our knowledge of r and v are related by the equation 

ArAv > h/m . (125) 

Substituting from this equation into equation (124) we find 

A r > r, 

showing that such an experiment will be impossible, since the un¬ 
certainty in the position of the electron under observation would exceed 
the distance of the electron from the magnetometer. In other words, 
the uncertainty in the field we want to measure would have to be 
greater than the field itself. The only way in which we can hope to 
detect the magnetic moment of a free electron is by obtaining a polarized 
beam, that is, a beam in which all the spin axes are pointing in the same 
direction or at any rate more in one direction than another. A Stern- 
Gerlach experiment would appear to be the obvious way of producing 
such a beam of electrons, but here again it can be shown by means of 
the uncertainty principle that this is impossible. It appears certain 
that the magnetic moment of a free electron can never be observed 
in any experiment based on the classical conception of an electron 
magnet. 

The idea that an unpolarized beam of electrons (i.e. one in which the 
spin axes are pointing in all directions at random) scattered from a 

t The argument is due to N. Bohr; cf. N. F. Mott and H. S. W. Massey, Theory of 
Atomic Collision p. 42 (Oxford, 1933). 
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target might be partly polarized was first put forward by Mott.| This 
polarization could be measured by allowing the scattered beam to fall 
on a second target. By projecting an unpolarized beam of electrons 
on to a foil thin enough to ensure single scattering, and allowing some 
of the scattered electrons to fall on a second thin foil, we should be 

L 


M 

Fig. 65. Experiment for producing and detecting electronic polarization. 

able to detect an asymmetrical scattering about the direction in which 
the beam falls on the second target. The theory of such an experiment 
was developed by Mott,f making use of the Dirac wave equations for 
the electron. 

Suppose the initial beam LT X (cf. Fig. 65) is represented by 

% = 4 ^, % = Be****, 

then the direction of the spin axis of the scattered beam is determined, 
according to equation (113), by the ratio of the amplitudes of the two 
components of the wave function of the scattered beam which, according 
to equation (114), are 

Am-Bg{d\), . Artej+Bfflj, 

where 8 1 is the angle between the incident electron beam and the 
scattered beam. Rotating our axes through an angle 0 X so that T X T 2 
becomes the axis of z we find the beam of electrons is represented by 

^3 == [W(^)-^(^)}co S ie i +{Ag(e i )+Bf(8 1 )}am tfj****, 

% = ^(6,)+Bf(e^cos ^-{AfidJ-BgidJjsin tfje***. 

By making use of equation (115) and averaging over all directions of 
the spin axis of the initial beam LT V we can obtain the number of 
electrons scattered by the second target in a given direction (0 2 , <f> 2 ). 
We are interested primarily in the asymmetry in the scattering about 
the line T t T 2 . For prescribed values of 0, and 0 2 a straightforward 

f N. F. Mott, Proc. Roy. Soc. A, 124, 425 (1929); 135, 429 (1932). 
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calculation shows that the number of electrons scattered per unit solid 
angle is proportional to , _ 8 cos ^ ( , m) 

where 


S _ o {/( g i )g* (g 1 )-/*(g 1 )y(gi)}{/(g 2) g*(g2)-/*(g !i ) g(^)} (vm 

. "{mrtoj+giWif)i)MW*Vt)+rtW(o*)Y " 

! Massey-Mohr 

i.onL /^(screened field) 


LExperi mental 
^ point 


Bartlett-Watson 


0-4 0-6 0-8 ' 1-0 

— v/C-±> 

Fig. 66. Theoretical values for the polarization ratio for gold, 
together with an experimental point. 

By means of these equations and the equations (119) and (120) derived 
for f(0), g(d ), we can estimate the order of magnitude of the effect to 
be expected in the double-scattering experiment outlined above. If 
we take both of the angles 8 1 and 0 2 to be 90° we have approximately 


*-u 


H)- 


Differentiating the right-hand side of this equation with respect to j8 
and equating it to zero, we find that 8 has a maximum when jS = 0*764; 
for this value of the velocity of the incident electrons we obtain 


=(f- 


Calculations of 8, without using any approximation, were carried out 
for gold (Z = 79) by Mott (loc. cit.) and repeated by Bartlett and 
Watson,f who have calculated somewhat more accurately the rather 
slowly convergent series involved. Massey and Mohrf have made the 
calculation for the self-consistent field of the gold atom. These results 
are shown in Fig. 66, compared with the experimental ratio obtained 

t J. H. Bartlett and R. E. Watson, Phya. Bev. 56, 612 (1939). 
t H. S. W. Massey and C. B. O. Mohr, Proc. Boy. Soc. A, 177, 341 (1941). 
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by Skull, Chase, and Myers.f No asymmetry was obtained for alumi¬ 
nium, in agreement with theory. 

57. The relativistic wave equation for two electrons 

57.1. Equations of Eddington and Breit 

Until now we have considered only the relativistic motion of a single 
electron moving in an external field, and no attempt has been made 
to discuss the interactions between two or more electrons. At the time 
of writing no complete relativistic theory has been formulated for 
the discussion of problems in which the interaction of two or more 
elementary particles is taken into account. Such a theory depends on 
a proper treatment of the interaction of a number of charged particles 
with an electromagnetic field. The difficulties encountered in formulat¬ 
ing a completely satisfactory theory of quantum electrodynamics appear 
at the moment to be so formidable that their resolution may be expected 
only along lines rather different from those so far attempted. For this 
reason quantum electrodynamics will not be discussed in this book;{ 
we shall merely indicate the results relating to the interaction between 
two electrons which have been derived from this source. 

The first attempts to set up a relativistic wave equation for two 
electrons, similar to Dirac’s equation for a single electron, were made 
by Eddington§ and Gaunt||. In the classical theory the energy of 
interaction between two electrons is 


! V 1 . V 2 


where v 1 , v 2 denote the velocities of the two particles and r 12 the distance 
between them. The first term of this expression refers to the electro¬ 
static energy and the other to the magnetic. Gaunt and Eddington 
replace this energy by 2 

r i2 

and so obtain the wave equation 

0 = K+ i n|)+(«i+ a S)m 0 c+^(ai.^)k (130) 

\ l cr ) 

where the wave-function Y has sixteen components which may be 
denoted by v ( m ,n = 1,2,3,4), 


t c. G. Skull, C. T. Chase, and F. E. Myers, Phya. Bev. 63, 29 (1943). 
j For a discussion, see P. A. M. Dirac, Quantum Electrodynamics; Comm . Dub. Inst. 
Adv. Stud . A, 1 (1943); A, 3 (1946). 

§ A. S. Eddington, Proc. Boy. Soc. A, 122, 358 (1929). 

|| J. A. Gaunt, Proc. Boy. Soc. A, 122, 513 (1929); Phil. Trans. A, 228, 151 (1929). 
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the suffix m referring to the electron denoted by 1, n to that denoted 
by 2. The matrices a), aj, are the usual Dirac matrices, but now they 
operate on the first and second suffixes respectively. As before 


n) 


h 8 e A i 
i 8x} + c j ’ 


n? 


h 8 e A2 
itef+c*’ 


(131) 


A 1 , A 2 denoting the potentials of electrons 1 and 2 disregarding their 
interaction, and 


n o=-^|+V+^ ) -^, 

ic ct c cr 


(132) 


n 0 =-m 4 . (133) 

Tt was, however, shown by Breitf that the consequences of equation 
(1 BO) are unreasonable. Making use of the quantum theory of the 
electromagnetic field developed by Pauli and Heisenberg, Breit has 
investigated the form of the Hamiltonian for two electrons. The wave 
equation corresponding to this Hamiltonian is found to be 



. n 1 +a 2 . n 2 +(at 1+ at 2 )m 0 c + 




(od.rXa 2 .r) 11 
r 2 j 


*¥ = 0. 


(134) 


This equation of Breit’s is not exact. It does not hold when the 
velocities of the electrons are too great or when the distance between 
them is too large, since the potentials are not retarded. With a slight 
modification, however, this equation has been shown to explain, with 
a fair degree of accuracy, the observed facts of the hyperfine structure 
of the helium spectrum. J 

The Breit equation (134) has been derived by Nikolsky § using Dirac’s 
theory of quantum electrodynamics.|| In this theory the Hamiltonian 
of the system of two electrons is the simple sum of the two Hamiltonians 
for the single electrons. No interaction terms are required in the 
Hamiltonian, the interaction being introduced by a wave-function T* 
which is a function not only of the coordinates of the two electrons 
but also of the variables describing the field. The interaction energy 
follows as a consequence of the quantization of the field. 

The Breit equation for two electrons is satisfactory only when cor¬ 
rections of the order ( vjc ) 2 to the non-relativistic terms are required; 


t G. Breit, Phys. Rev . (2), 34, 553 (1929). 

t Ibid.; 36, 383 (1930); 39, 616 (1932); 51, 248 (1937); 53, 153 (1938). 
§ K. Nikolsky, Phys. Zeita. d. Sowjetomion, 2, 447 (1932-3). 

|| P. A. M. Dirac, Proc . Roy . Soc . A, 136, 453 (1932). 

Z 


6072 
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v denotes the velocity of either electron. It is fortunate that an equation 
correct to this order should he sufficient for calculating the stationary 
states of two extranuclear electrons and that the present gaps in our 
knowledge of quantum electrodynamics do not make the calculation 
of spectroscopic energies impossible. It appears that the approximation 
to terms of order vjc is sufficient for the rough treatment of energy 
levels while the approximation to (vjc) 2 apparently gives a satisfactory 
account of their fine structure. 


57.2. The correspondence method of Mailer 

We have noted above that the general theory of the interaction of 
two electrons can only be based on a proper theory of quantum electro¬ 
dynamics. An important class of problems can, however, be solved in 
a more elementary way. The method is due to Holler, f 

In these problems one considers two electrons initially in states with 
wave-functions TJ(r), Xi ( r )* These are taken to be Dirac functions, and 
thus correspond to definite spin directions. We require to calculate 
the probability that a transition occurs to a state in which the two 
electrons are in states with wave-functions xp The transition occurs 
under the interaction of the electrons alone. Using non-relativistic 
quantum mechanics it is of course possible to calculate the transition 
probability to any power of the interaction. The relativistic theory 
given here gives an answer to the first power only, and is adequate 
when this is sufficient. 


The method is as follows. First one forms the charge and current 
densities associated with the transition of one of the electrons, in the 
sense of Chapter X, § 46. Using the Dirac theory, these are 
p = complex conjugate, 

j = j^e- 27nV <r *-f complex conjugate, ^ * , 

where v if denotes the transition frequency (Wi—Wfjjh, and 

* = {m) 

i fi = ce'F/a't;.. (U0) 

One then writes down the scalar and vector potentials of the electro¬ 
magnetic field that such a transition would produce. These are given, 
as in § 46, by the equations 


■hwY- Ti' 


t C. Mailer, Zeits. f. Physilc, 70, 780 (1931); Ann. der Physik , 14, 531 (1932). 
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We take the solution corresponding to retarded potentials (i.e. to an 
outgoing wave). These are 

0 _ r (W 

J | r —r' | 

" |r—r'| 

We then allow these potentials to act on the second electron, and 
calculate by the ordinary methods of perturbation theory the probability 
per unit time that the electron will make a transition to a state xp 
Owing to the term e- Mv * 1 in the formulae (l37 a) for O and A, the 
transition probability will be zero unless energy for the whole system 
is conserved. 

In the Hamiltonian for the second electron the perturbing term is 
(cf. equation (19)) 

?{0+(«.A)}. 

t/ 




To calculate the probability that after a time t a transition has taken 
place from the state x% to the state Xf we make use of formula (10) 
of Chapter X. This in our case may be written 

J J eW/,|r ‘- ril/C ^(^)Xi(»- S ) dr.dr^ 

(138) 

It is here assumed that all the wave functions are normalized to unity 
and that p(W) dW is the number of states of the final system in the 
energy interval dW. 

This formula cannot be correct, because if we make c -> oo (non- 
relativistic case) it does not tend to the correct non-relativistic result, 
using antisymmetrical wave functions. We must therefore replace it by 




. (139) 


This expression has the correct symmetry; but it will be appreciated 
that the derivation does not depend on any very consecutive argument. 


57.3. Collisions between two fast charged particles 
The method outlined above was first applied by Mollerf to the 
discussion of the non-radiative collision between two free electrons. 


f Loo. cit. 
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The accuracy of Mailer’s calculations is of the same order as that of 
the first approximation in Born’s method, so that for fast particles his 
results neglect terms of order e a /Ac in comparison with unity. They 
should, however, be exact to all orders of vjc. 

In his treatment of the problem Mailer refers the scattering to a 
Lorentz frame of reference in which the centre of gravity of the two 
electrons is at rest and their momenta are equal and opposite. The 
angle of scattering, 0*, in this frame is related to the observed angle of 
scattering, 0, by the formula 

a* (y+3)sin 2 0 t . 

0* = cos - 1 -— ) r - ' . (140) 

2+(y—l)sin 2 0 v ' 

where y is given in terms of the relative velocity of the two electrons 
before collision by the equation 


nr- 


For TJ, x Iy, Xu X/ Mailer takes plane waves. He finds that the cross- 
section for scattering between angles 0, 6+dd is 


f-j 

V»o®7 7 L(i— 


M 2 ) 2 (1 ~V)' 


(y i) s 




where we have written, for convenience 

fi = cos0*. (142) 

This formula, it will be noted, contains no terms in ft, the Planck 
constant. 

It is of some interest to compare Mailer’s formula (141) with others 
of a similar nature. For low velocities equation (141) reduces to the 
form 


I(6)dd — 477j |(cosec 4 0+sec 4 0~cosec 2 0sec 2 0)- 


-(4 cosec 4 0+3sec 4 0— 2 cosec 2 0 sec 2 0—3sec 2 0)[sin 20 dd 


Now the corresponding formula for the scattering of slow /J-particles 
(vjc 1) by electrons is (cf. §24.4) 

( e 2 \2 

-J {cosec 4 0-fsec 4 0—cosec 2 0 sec 2 0 cos T7}sin 20 d0 9 

MqV j 


where 


-logcot 0. 
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If e 2 jhc is small enough to allow us to put U = 0, while v/c is small 
enough to be neglected, the two formulae are equivalent. This is the 


case if 


1/137 < v/c < 1. 


The scattering of fast /3-rays by electrons up to angles of about 12° 
has been investigated experimentally by Williams and Terrouxf and 
by Williams. $ Instead of measuring the angles of scattering they 
measured the energy of the recoil electrons; for v/c ranging from 0-60 
to 0*97 along a track of length 18 metres they observed about seventy 
branches with energies between 7,500 and 40,000 volts. This number 
they compared with the effective cross-section§ for loss of energy 
between Q, Q+dQ f namely, 

e* dQ 

77 V 2 Q2 ’ 


which may be deduced from Holler's formula for small angles. At the 
higher velocities the observed scattering was roughly twice that pre¬ 
dicted by this formula, but the ratio diminished as vjc decreased. 

A more direct attempt to verify Moller’s formula (141) has been 
made by Champion,|| who from the analysis of about 650 m. of track 
of fast /3-particles in nitrogen, photographed by the expansion method, 
obtained 250 collisions with atomic electrons in which the angle of 
scattering lies between 10° and about 30°. The values of the ratio vjc 
ranged from 0-82 to 0*92. The number of particles scattered into the 
ranges 10°-20°, 20°-30°, > 30° was measured and the corresponding 
theoretical values obtained from Moller’s formula by numerical integra¬ 
tion. The results are shown in Table 10. 


Table 10 


Angular 

range 

Number scattered 

Observed 

Calculated 

> 30° 

10 

13 

20°-30° 

26 

30 

10°-20° 

214 

230 

Total 

250 

273 


It will be seen that the observed values and those calculated from 
Moller’s formula are in good agreement; it should, however, be noted 

t E. J. Williams and F. R. Terroux, Proc. Roy. Soc. A, 126, 289 (1929). 
t E. J. Williams, ibid. ISO,' 328 (1930). 

§ N. Bohr, Phil. Mag. 25, 10 (1913); 30, 58 (1915). 

1J F. C. Champion, Proc. Roy. Soc . A, 137, 688 (1932). 
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that most of the observed scattering angles were less than 30° when the 
contribution of the spin to Moller’s formula is very small. 

The theoretical analysis has been extended by Mollerf to the case 
when a quantum is emitted in the collision of two fast charged particles. 
The cross-section for such a process is calculated first by means of the 
correspondence method of Moller and then by means of the quantum 
electrodynamics of Heisenberg and Pauli,f and Dirac;§ the same result 
is obtained by both methods. The formula for the case in which one 
of the colliding particles is infinitely heavy corresponds to that derived 
by Bethe and Heitler|| for the radiation emitted in the collision of a 
fast charged particle with an atomic nucleus. 

The methods of quantum electrodynamics have been employed by 
Hulmef f to investigate the interaction of a particle in the nucleus of an 
atom with an electron in the X-shell (a case of some interest in the 
theory of internal conversion of y-rays). Hulme’s results show that for 
bound particles Moiler’s correspondence method gives the correct result 
if we take the retarded potentials due to the transition where the first 
particle jumps down, and then consider these as perturbing the second 
particle. For free particles, however, we may take either the advanced 
or the retarded potentials, or we may put the results in a symmetrical 
form by taking the mean of the two, without affecting the result. 

57.4. The relativistic theory of the Auger effect 

If a A"-electron is ejected from an atom, an outer electron, e.g. an 
L-electron, can fall into the X-shell giving up its energy in one of two 
ways: 

(а) By radiation, a process which gives rise to the X-ray emission 
lines. 

(б) By transferring its energy to one of the outer electrons, which is 
thereby ejected from the atom. This process is known as the 
Auger effect. Experimental investigations have been made of 
the ratio between the number of y-quanta and the number of 
Auger electrons. 

In calculating the probability of the Auger transition, one treats the 
process as a transition to a state of equal energy of the system com¬ 
prised by the two electrons. The formalism is thus exactly that of the 
foregoing section, and one uses equation (139). 
t C. Mailer, Proc. Roy. Soc. A, 152, 481 (1935). 

t W. Heisenberg and W. Pauli, Zeits.f. Physik , 56,1 (1929); 59, 168 (1929). 

§ P. A. M. Dirac, Proc. Roy. Soc. A, 136, 453 (1932). 
jj H. A. Bethe and W. Heitler, ibid. 146, 85 (1934). 
tt H. R. Hulme, ibid. 154, 487 (1936). 
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The probability of the internal conversion of the JK'-series radiation 
for gold arising from the interaction of the L T L v L Y L n , and L x L in 
shells has been calculated by Massey and Burhopf in this way. Two 
approximations arise in the course of the calculations: 

(i) The problem is treated as one of two electrons only, the perturba¬ 
tion by these of the wave functions of the other atomic electrons 
being neglected. 

(ii) Screened hydrogen-like wave functions are used for the atomic 
electrons. 

The second approximation is probably quite accurate, since it is 
known that it gives satisfactory results in the discussion of the non- 
relativistic theory of the Auger effect. J The error in the first approxi¬ 
mation cannot be estimated until the wave functions of electrons in an 
atom ionized in an inner shell are found (say, by the method of the 
self-consistent field). 

Details of the calculation are as follows: 


Suppose that an electron in the X-shell falls into an empty A-lev el 
and that the energy given up raises a second L-electron to a state of 
the continuous spectrum. If we denote by TJ, the Dirac wave func¬ 
tions of the initial and final states of the ejected electron and by x*> X/ 
those of the other electron, we find that if x ¥ f is normalized to represent 
one emitted electron per unit time the number of transitions per unit 
time is given by ( , r 

lij T*;{-^- e pi(a.a)}^dT. (146) 

In this equation p 1 and c are the usual Dirac operators. The equations 
defining a, <f> are analogous to equations (137a), viz. 

t = - f -r-Vi Xi dr '' 

J 1 1 (147) 

a = J Pl <s Xi dr' ^ 

Substituting from equations (147) into equation (146) we obtain for 
the probability per unit time of a transition 

\ A 1i\ 2 = 1 lj •^pjexp | 27n ^’ : | ; r ~ r - l j{pp , -j ■ j'} drdr’ , (148) 

where we have written, for convenience, 

P = ~ e Xf Xu ) = e X* Pi °Xu 

P = j' = eY? Pl o% 


t H. S. W. Massey and E. H. S. Burhop, Proc. Hoy. Soc. A, 153, 661 (1936). 
} E. H. 8. Burhop, ibid. 148, 272 (1935). 
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From these definitions it follows immediately that (p, j) are time- 
independent charge density and current vector for the L-+K transition, 
and that (p 9 j') are the same quantities for the L ->co transition 
(oo denoting a state of the continuous spectrum). 



The formula (148) does not take the Pauli exclusion principle into 
account. To do this we introduce the quantity | B fi \ 2 analogous to (148) 
but with the final states interchanged. The possibility of electron 
exchange is then taken into account if we take the total probability 
of occurrence of the double transition per unit time to be 

\A n -B H \*. (149) 

The application of formulae (148) and (149) to the calculation of the 
probability of the Auger transitions L v L x -> K 9 oo; 4. 4i K, oo; 
Lj, L 1U K,od has been carried out by Massey and Burhop for gold. 
The detailed calculations show the relativistic effects to be of con¬ 
siderable importance for gold; the if-series internal conversion coeffi¬ 
cient being increased by a factor of two. The relative intensity of 
the above three transitions is given in the non-relativistic approxima¬ 
tion by the ratio 1: 1*14:2*28, while the relativistic theory gives a ratio 
1:5-5:5*3. If these conclusions were tested experimentally we should 
have some insight into the validity, not only of Dirac’s equation for 
the single electron, but also of its approximate extension to two 
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electrons; no experimental confirmation of these results has so far been 
adduced. 

The theoretical values of the internal conversion (ratio of electrons 
to X-ray quanta) coefficient are shown by the full line in Fig. 67. The 
values for elements of high atomic number are deduced from the theory 
developed by Massey and Burhop, those for low atomic numbers from 
the paper by Burhop cited above, and from a paper by Pincherle.f 
There is satisfactory agreement between the theoretical curve and the 
points determined experimentally. 

57.6. The internal conversion of y-rays 

It is well known that a y-ray emitted from the nucleus of a radio¬ 
active atom may be absorbed by one of the outer electrons with the 
production of a secondary /3-ray. The intensity of the resultant /?- and 
y-rays may be observed and the proportion of y-rays reabsorbed in the 
atomic system calculated; this ratio is known as the internal conversion 
coefficient. A theoretical estimate of this quantity was first made by 
SwirlesJ and Fowler§ using non-retativistic wave mechanics. Swirles 
replaces the nuclear mechanism emitting the y-ray by an oscillating 
Hertzian dipole, radiating classically. The radiation field set up by 
such an oscillator produces photoelectric transitions in the planetary 
electrons which are then discussed on the Schrodinger theory. The 
values of the internal conversion coefficient found in this way were 
not in agreement with the experimental values. The process has since 
been treated at length by various authors|| using relativistic quantum 
mechanics and a field of the type suggested by Swirles. 

Suppose that after a radioactive transformation the product nucleus 
is left in an excited state and then falls to one of lower energy with the 
emission of a y-ray or the ejection of an extranuclear electron. Let 
g dt denote the probability that in a time dt the nucleus makes such a 
transition between two given states with the emission of a y-ray, and 
b B dt the probability that the same nuclear transition occurs with 
the ejection of an extra-nuclear electron from some X-ray level of 
the atom. Thus corresponding to each level K, L l9 we have the 

t L. Pincherle, Nuovo Cimento, 12, 81 (1935). 

t B. Swirles, Proc. Boy. Soc . A, 116, 491 (1927); 121, 447 (1928). 

§ R. H. Fowler, ibid. 129, 1 (1930). 

|| H. R. Hulme, ibid. 138, 643 (1932); H. M. Taylor and N. F. Mott, ibid. 138, 
665 (1932); 142, 215 (1933); H. M. Hulme, N. F. Mott, J. R. Oppenheimer, and 
H. M. Taylor, ibid. 155, 315 (1936); J. B. Fisk, ibid. 143, 674 (1933); C. D. Ellis 
and N. F. Mott, ibid. 139, 369 (1933); J. B. Fisk and H. M. Taylor, ibid. 146, 178 
(1934); N. F. Mott, Ann. Imt. Henri Poincare , 4, 207 (1933); Handbuch der Physik, 
24, 809 (1933); J. C» Jaeger and H. R. Hulme, Proc. Boy. Soc. 148, 708 (1935). 
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probabilities b K , b Lv b LlV .. and the total probability per unit time of the 
ejection of an electron from any shell is 

b = b K -\~b Ll +b LjI +.... 

The internal conversion coefficient is then defined by 

b 

since b+g is the total probability per unit time that the transition takes 
place. The internal conversion coefficients for the various levels are 
defined by b b 

a£= ^’ 

etc., and the ratio of the number of observed y-rays to the number of 
K -electrons is 


The choice of the name ‘internal conversion coefficient’ is unfortunate 
since we now regard the emission of electrons as due to direct inter¬ 
action between the nucleus and the atom rather than to the emission 
and subsequent reabsorption of a y-ray. The relativistic theory includes 
the effects of direct interaction as well as those of y-ray emission and 
roabsorption; in fact, no sharp distinction can be made between them. 

In making the calculations, the unperturbed system is taken to be 
an electron under the influence of a central charge Zc . We then have 
A = 0, A 0 s= Ze/r. (150) 

To determine the perturbing potentials we must make some assumption 
about the field which is to represent the y-ray emitted by the nucleus. 
Suppose the perturbing field is represented by 

A 0 - %e-^+K^ 27Tivt ) (151) 

A = ) 


and suppose that T 0 represents the normalized wave function of an 
extranuclear electron for the ground state, and that of a possible 
final state, determined by Dirac’s wave equation (19) with A and A 0 
given by equation (151). Then the probability per unit time that an 
electron is ejected because of the field (151) is given by 


b = 2 (s) # | J | 2 , (152) 

where TJ is normalized so that one electron crosses unit area per unit 
time and the summation is taken over all possible final states. The 
functions Y 0 , TJ have already been determined (§56.2 above) so that 
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once % 0 and 81 have been prescribed the problem reduces to that of 
calculating the matrix elements occurring in the series (152). 

The number of possible fields (151) is, of course, infinite; the various 
types considered with the corresponding forms of 81, 2I 0 are exhibited 
in Table 11. In this table B is a constant and q denotes 27 rv/c. 

For the dipole radiation the energy radiated by the field is 


so that for dipole radiation and K -conversion we obtain 


9 1- 


3 tic 
8ttvB* 




Y*{e%+e Pl (<U .a)}%rfr , (154) 


where T 0 is the wave function of the instate. Since both 9t 0 and |8l| 
are proportional to B , the unknown constant B is not involved in this 
expression. Taylor and Mottf showed that the number of quanta 
escaping from the atom differs from g 0 dt (the number of quanta ejected 
in the absence of the outer electrons) only by a factor of order of 
magnitude 1/137 which may be neglected, and is not (g 0 —b)dt as was 
previously supposed. The effect of the extranuelear electrons is thus 
to increase the total nuclear transition probability per unit time. 

The evaluation of the matrix elements involved in equation (154) has 
been carried out using the wave-functions My, %, which are the exact 
solutions of the relativistic wave equation for an unscreened nuclear 
field of potential Ze/r. The possible sources of error are thus: 

(i) the assumption that the dipole, quadripole, or other field used 
is the correct one; 

(ii) the neglect of screening; 

(in) the use in the integrations (152) and (154) of the dipole and 
quadripole forms for the field of the y-ray at all distances from 
the centre of the atom; this is obviously incorrect within the 
nucleus (r < 10~ 12 cm.). 

The second of these errors is probably not important since the energy 
of the ejected electron, even for soft y-rays (^ 150 kV.) is always much 
greater than the difference between the screened and unscreened 
potential energies anywhere in the atom. The third source should not 
introduce an error greater than the small fraction 


radius of nucleus 
wave-length of y-ray’ 


(155) 


unless the field within the nucleus is unexpectedly great. 


t H. M. Taylor and N. F. Mott, Proc. Pmj, Soc. A, 142, 215 (1933). 
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Conversion in the K-shell. The results of the calculations for dipole, 
quadripole, and magnetic dipole radiation are shown in Fig. 68. 
Magnetic quadripole and octopole radiation give considerably higher 
values. Experimental points are shown on the same diagram. 



I_i_ i --- i 

0*1 0*2 0*3 0-4 0-5 0-6 


Energy (millions of volts) 
Fig. 68a. 



Conversion in the L-shell. The internal conversion coefficients for 
quadripole and dipole radiation are identical for very hard radiation 
(i hv ->oo). For dipole radiation Hulme has shown that in this limit 

a 7r :a Li :a Ln :a Lnr “ 6*7:1:0*0080:0*044 
and that for hv — 7*2 x 10 6 eV. 

a ^ :a Li ” 7*0:1. 
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Fisk has shown that for quadripole radiation this ratio remains 
practically unaltered down to energies of order hv — 2*5 x 10 5 eV. 

The ratio oc K /oc Ll has been estimated by Ellisf for a number of lines 
and found to be roughly in agreement with the theoretical value 7*0. 
Gray and 0 ’Leary$ find that for RaD in the ultra-soft region 
(hv = 0-475 X10 6 ) 

-J^ 1 - = 2-2. 

1_ «i* 

For dipole radiation the theoretical value is 1-8 and for quadripole 2-75. 

The experimental determination of the internal conversion coefficient 
is open to appreciable errors, but the data seem to exhibit the following 
features: 

(i) The experimental values plotted against hv may be fitted to two 
smooth curves. 

(ii) These two curves have the same general form as the theoretical 
curves for dipole and quadripole radiation, but in the soft y-ray 
region (hv < 3.10 5 eV.) the theoretical values are more than 50 
per cent, too small. 

A detailed analysis of the experimental results for elements of odd 
mass number shows that for ordinary dipole transitions (A Z = 1) the 
magnetic quadripole terms are weak in comparison with the electric 
dipole terms, the ratio of the intensities being of order /c 4 , where k is 
defined by equation (155). Even for the hardest y-rays this ratio is 
only about 10“ 4 , so that we should expect transitions for which A? = 1 
to have almost exactly the theoretical coefficient derived on the assump¬ 
tion that the field is due to a dipole. In the quadripole transitions 
(A l = 0) the intensities of the magnetic dipole and electric quadripole 
terms are of the same order, while the magnetic octopole terms are 
negligible. The ratio of the intensity of the magnetic dipole terms to 
the electric quadripole terms varies roughly as the square of the wave¬ 
length of the y-ray and consequently increases for softer y-rays. Thus 
if the soft y-rays arise from transitions between the various states of 
a multiplet level, then the internal conversion coefficient in the region 
of long wave-lengths approximates to the theoretical magnetic dipole 
internal conversion coefficient curve, and in the region of short wave¬ 
lengths should fall below this, and lie near to the corresponding 
theoretical curve for electric quadripole radiation. 

t E. Butherford, J. Chadwick, C. D. Ellis, Radiations from Radioactive Substances, 
p. 362 f. 

t J, A. Gray and A. J. O’Leary, Nature , 123, 568 (1929). 
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58. The Dirac equations in spinor form 

We noted in §53.6 that although Dirac’s equation for the electron 
is invariant under a Lorentz transformation, it can only be written in 
a tensor form in an artificial way, because of the fact that all the 
representations of the Lorentz group are not included in the tensor 
formalism. The necessary extension of the tensor calculus—the spinor 
calculus—was given by van der Waerden ;*)* with this calculus the equa¬ 
tions may be written in an automatically covariant form. 

Any pair of quantities (£ x ,f 2 ) which transform according to the 
formulae 


where the coefficients satisfy the condition 

I O11 o 


11 ^12 

da 1 dt 


*21 ^22 I 

is called a spinor of the first rank and is denoted by £ k (k 
Similarly any two numbers which transform like 

^i = 0*1771+0*2 ^ 2 ’ = a *i 12 


(150) 

(157) 
1 , 2 ). 

(158) 


form a spinor which we shall denote by 77 ; (r = 1 , 2 ). 

The contravariant spinors of the first rank £ k , -rf are defined by the 
relations ^ _ g 

l l = lb V 2 = —lb 

which ensure that the scalar products t; k £ k , tf iq- r are invariant. 

From these two types of spinors we can build up spinors of higher 
rank. Thus any four quantities transforming like the products 
£ifi> £1 £ 2 * £2 £ 1 , £2 £2 f° rm a spinor of the second rank which we shall 
denote by Similarly second rank spinors of the type rj- can be 
defined. We may also define *mixed 5 spinors of the second rank as 
quantities transforming like a product of a £ and an 77 denoted by l* rk . 
Analogously we can define spinors of higher rank. The main rules of 
operation of spinor calculus are that it is not necessary to fix the 
relative position of dotted and undotted indices of the same spinor, 
and that the complex conjugate of any spinor equation is obtained by 
replacing all the dotted indices by undotted indices and vice versa. 
Two identities of great use in spinor analysis are 

a k b k = ~a k b k ; (159) 

a k K <*+ a m b k c k +a k b k c m = 0; (160) 


t B. van der Waerden, Gdtt. Nacht . 1929, 100. See alflo O. Laporte and G. E. Uhlen- 
beck, Phys, Rev. (2), 37, 1380 (1931). 
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their proof follows at once from the definitions. These rules also hold 
for dotted suffixes. 

We now introduce a spinor operator d- a defined by the equations 


-at 


d' 12 = 


fix* 


dx» 


■af = = 


dx. 


A- 


u n ■ 


b_ 

3 @ x 4 

C)Xn C)Xm 


Now it is obvious from the transformation equations (41) and (42) 
that the four scalar wave-functions x i[, Hg, X F 3 , H 4 of the Dirac theory 
form two spinors of the first rank which we shall denote by and xi* 
The analogue of the four-vector A^ (fx = 1, 2 ,3,4) will now be a mixed 
spinor <f> l7k . It follows immediately that the Dirac equations may be 
written in the spinor formf 

= 0 , (161) 

m 0 cXh+pixx = ( 162 ) 

where we have introduced the abbreviation 


(103) 


(164) 


Eliminating \i from (161) and (162) we obtain 

= 0. 

Now by means of the identities (159) and (160) we have 

Ap 6 x% = 

and from the definition (163) of pf we have 
where g^ is defined by 

9mb ~ ®b\ ^rkX 4^’ 

Thus the second-order wave equation (164) reduces to the form 

11e»*+<£ dA )(f (166) 

t The summation convention is used throughout, i.e. when a suffix occurs twice the 


(165) 
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The left-hand side of this equation is identical with that of the spinor 
form of the Klein-Gordon wave equation. The term on the right-hand 
side represents the spin correction. 

In the spinor notation the Darwin Lagrangian (56) may be written 
in the form 

L = J{TAgAT A - X ^ XA -^0AY. +x AaA x . }+ 

+ 2m 0 c(T A x a +T*x*)+^(T a Y a + x - Xa )-^.( 167) 
where the quantities g^ p are defined by equation (165). 

Considering I as a function of x ¥ ) 4> an d their derivatives, and 

forming the Euler-Lagrange equations as before, we obtain the Dirac 
equations in spinor form together with the Maxwell equations with the 
Dirac current. In this way we obtain spinor expressions for the current 
and the charge density. 


59. Relativistic wave equations for particles of spin greater than 

a half 

Relativistic wave equations for particles of any spin have been 
derived by Diracf and by Pauli and Fierz,J on the assumption that 
each of the scalar wave functions satisfies, in the absence of interaction, 
the generalized second-order wave equation. The resulting equations 
connect two irreducible spinors but may be split by a suitable trans¬ 
formation into two sets. The simplest case in the Dirac scheme (apart 
from the Dirac equations of the electron) is that for a particle with 
twice the electronic spin. In this case we have for a particle of non¬ 
zero mass m 0 in the absence of an electromagnetic field the spinor 
equations 

2 P aK AKX = V2 m 0 B% 2 p &K Bi = (l 68 ) 

These equations relating the field quantities J3^, A K \ are analogous to 
the Maxwell-Lorentz equations of the electromagnetic field, an essential 
difference being that all the quantities are here assumed to be complex. 
The transition of these equations from spinor to tensor form has been 
effected by Kemmer§ who showed that there are four possible ways of 
writing the equations (168) in tensor form. They are 


d(f> 

dx a 




!§£—*■ 


(169) 


t P. A. M. Dirac, Proc. Roy. Soc. A, 155, 447 (1936). 

t W. Pauli and Fierz, Helv. Phys. Acta , 12, 3, 297 (1939); Proc . Roy. Soc . A, 173, 211 
(1939). 

§ N. Kemmer, ibid. 166, 173. 

6072 Aa 
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2 - 1 ?=^ < m > 


and finally 


(with <£ a(3 = -fyj, 


8 <f>fiyS_Hc^ , e< £ a jSS fygfo _ V V^ 8 _ .^ y 8 

1ST &e fl + ar., “ KXa ^ 8, Z 8x„ - Kf ’ 


a -1 a 


with </>cxp y = ^jSay = *“^ayj5- ( 173 ) 

In all these equations a, /?, y, 8 run from 1 to 4 and 

* — m 0 c/ft. 

It is immediately obvious that equations (169) are equivalent to the 
Klein-Gordon equation 4 

2SI-* <™» 

a=*l “ 

Equations (170) are those proposed earlier by Procaf for Bose-Einstein 
particles (i.e. particles of spin one). The analogous equations (171) and 
(172) had not been considered prior to Kemmer’s analysis. From these 
equations Kemmer develops four inequivalent possible forms of field 
theory for Bose-Einstein particles. 

It was pointed out by Kemmer J and Duffin§ that the Proca equations 
(170) could be written in the form 

22%f = 

v,p r n F 

if then we construct a column matrix T whose ten elements are the 


components of the vector <f> and the distinct components of the tensor 


Xajs, the Proca equations assume the form 


U«1 r 1 

(176) 

r A 

here the satisfy the relations 


PpPvPp+PpPvPp = PpiKp+Pp 8 p.v> 

(176) 


which are analogous to the relations = 28^ in the Dirac 

theory. 

t J. Proca, J . Phys, Radium (vii), 7, 347 (1036). 

} N. Kemmer, Proc. Roy . Soc. A, 173, 91 (1939). 

§ R. J. Duffin, Phys. Rev. 54, 1114 (1938). 
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The algebra defined by the equation (176) has been examined in 
detail by Kemmer. | Just as in the case of the algebra defined by the 
Dirac a-operators there are 16 linearly independent quantities among 
the four ex’s and their multiple products, in the algebra of the ^-operators 
there are 126 independent elements of the algebra formed from the 
multiple products of the four j8’s and the unit matrix. The /3’s may be 
represented by 10 X 10 Hermitian matrices. J 

A question which arises naturally out of these investigations is 
whether the behaviour of elementary particles of any integral or half¬ 
integral spin can always be described by an equation of the type 

{Pk<*+lflY = 0, (177) 

where the p k are the usual differential operators, x * s a constant, and 
the quantities oc k are four matrices describing the spin properties of the 
particle. This problem has been investigated by Bhabha,§ who suc¬ 
ceeded in showing that for particles of spin greater than 1 the equations 
of Dirac, Pauli, and Fierz cannot be written in the form (177). Bhabha 
postulates that the fundamental equations of the elementary particles must 
be first-order equations of the form (177) and that all properties of the particles 
must be derivable from these without the use of any further subsidiary 
conditions. As a result of this assumption it is found that the scalar 
wave functions do not satisfy a second-order wave equation as in the 
theory of Dirac, Pauli, and Fierz, but Bhabha considers his assump¬ 
tion to be more logical than theirs. He finds that each component of 
the wave function satisfies an equation of order greater than two which 
can be written in the form of products of the operators of the usual 
second-order equation. A consequence of these equations is that we 
must interpret every particle of spin greater than one as possessing not 
one but several values of the rest mass. These states of higher rest 
mass are a necessary feature of the theory. It is shown that these 
higher values of the rest mass are simple rational multiples of the lowest 
mass. For example, particles of spin f, if they exist in nature at all, 
would be expected to appear with two values of the rest mass, m and 
3m, the former being the stable rest mass. Particles of spin 2 should 
they exist, would appear with rest masses m, 2m. 

f N. Kemmer, Proc. Camb. Phil. Soc. 39, 189 (1943); see also E. Schrodinger, Proc. 
Roy . Irish Acad. 48, A, 135 (1943). 

X N. Kemmer, loc. cit. 

§ H. J. Bhabha, Rev. Mod. Phys. 17, 200 (1945). 
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60. Introduction 

In this book we have employed, the methods of wave mechanics as 
developed by Schrodinger and others. Although these methods are the 
most suitable for the applications of the theory to the simpler problems 
of physics and chemistry they were not the first to be put forward to 
replace the older quantum theory of Bohr and Sommerfeld. The first 
system of equations designed to that end was set up slightly earlier 
by W. Heisenberg and developed by Born, Jordan, and Dirac. Because 
of the fact that the quantities occurring in this system of equations 
obeyed the laws of matrix algebra, the theory became known as ‘matrix 
mechanics’. This formulation of quantum mechanics is mathematically 
equivalent to wave mechanics, but since it is sometimes more con¬ 
venient to make use of the results in the matrix form we give here 
a very brief outline of the foundations of matrix mechanics without 
going into the detailed solution of special problems. The relation of the 
Heisenberg equations to the Schrodinger equation is also indicated. 

The theory of matrix mechanics was developed from the phenomeno¬ 
logical point of view; from the beginning Heisenberg adopted the 
principle that only physical quantities which are actually observable 
should be incorporated in the new theory. Heisenberg proposed to 
build up a theory solely in terms of observable properties of atomic 
systems such as the frequencies, intensities, and polarizations of the 
radiations emitted by such systems and omitting concepts of doubtful 
observability such as the positions and velocities of electrons. The 
relation of matrix mechanics to classical mechanics is more direct than 
that of the Schrodinger theory because the transition from the classical 
theory to the quantum theory is made here at an earlier stage. The 
transition from classical dynamics to the Schrodinger wave mechanics 
corresponds to a later stage in the development of the classical theory; 
indeed, Schrodinger’s method consists in replacing the Hamilton-Jacobi 
equation of classical theory by a wave equation suitable for the dis¬ 
cussion of atomic processes. ; 

We shall begin by recalling some of the repults of classical mechanics. 

61. The classical Poisson bracket J 

In classical mechanics a dynamical system possessing n degrees of 
freedom is completely specified by the generalized coordinates q { and 
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( 1 ) 


the generalized momenta p { (i = Any other dynamical 

variables of the system, say, F , can be expressed uniquely as 

functions of these generalized coordinates and momenta. From any 
two such dynamical variables F , G the expression 

V 00\ 

may be formed; such a quantity will be called a classical Poisson bracket 
and be denoted by [F, G], It follows immediately from the definition 

['•*)--g. < 2 > 

so that in classical mechanics the differentials with respect to the 
variables p i may be replaced by a Poisson bracket. In particular 
we may write Hamilton’s canonical equations 

dH . dH y m 

*--<* ./ <3) 
in the form &+[#>?<] = °» Vi+[ H >Pi\ = °- ( 4 ) 

The importance of the Poisson bracket in classical mechanics arises 
from the fact that it is invariant under a contact transformation. It 
follows at once from the form (4) of Hamilton’s canonical equations 
that they too are invariant under a contact transformation. 

The equations (4) may be generalized for any dynamical variable, F, 
which does not contain the time explicitly; thus 

* s?(dF.,dF.\ ^IdFdH dF dH\ r VTJ] 

* -1 k'+w ,*)-1 k *-w, wJ - l6) 

Many of the most interesting and useful properties of the classical 
Poisson brackets follow immediately from the definition (1). For 
example, the equations 

br»ft] = [«r.Sj 

= S rs 


°) 


( 6 ) 


are readily established. The dynamical variables occurring in classical 
mechanics can usually be expressed as polynomials (or power series) 
in the coordinates q i and the momenta p i9 so that Poisson brackets 
involving them can be calculated from rules of the type 

[J’ ) (? 1 +« 2 ] = [JP ) G' 1 ]+[7’ j (7 2 ] ) (7) 

[F, Q l GJ = [F, 0^+Oj[F, 0,1 (8 a) 

. [F 1 F 2 ,0] = {F 1 ,ff}F 2 +F 1 [F 2 ,0l (86) 

all of which follow from the definition (1). 
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62. The quantum Poisson bracket 

We shall now investigate the analogue of the Poisson bracket in 
quantum mechanics. To do this we consider some of the properties 
of the classical Poisson bracket to see which of them may reasonably 
be taken over into the formalism of quantum mechanics. It follows 
immediately from (8 a) that 

[«, 0j = w+0i[«> 0j 

and from (86) that the right-hand side reduces to 

W. 0iW 0 1 ]O i +G 1 [F v G^+G.F^, G,}. 

Similarly by applying the theorems in the reverse order we can show 
that 

[FiF % , G :l G 2 ] = [Ji, G,}F,+FIF 2> Q{\G 2 +F 1 G x [F„ G 2 \ 

Equating these two values of the Poisson bracket [F 1 F 2 , O x G 2 ] we 
obtain the relation 

F l a l -G l F t _ f 2 g 2 -g 2 f 2 
0J ~ W,0J ' 

In classical mechanics this result is trivial since the numerator of each 
fraction vanishes identically; but we have seen that in quantum 
mechanics where the dynamical variables are in general operators, 
FG—GF is not always zero. In such a theory we might then interpret 
equation (9) as meaning that the Poisson bracket [jF, G ] is proportional 
to the expression FG— GF . The constant of proportionality in a special 
case was first calculated by Diracf, who considered the expression 
FG— GF of the quantum theory and found the corresponding formula 
of the classical theory to which it must tend in the limit for large 
quantum numbers. In the classical theory the F and G are functions 
of the generalized coordinates and momenta of a multiply periodic 
system. Dirac found that for large quantum numbers FG—GF corre¬ 
sponded to the expression ifi[F, G ] of classical mechanics. Generalizing 
this result he postulated that the equation 

FG-GF^ ih[F,G] (10) 

holds in all circumstances. Equation (10) is the basic axiom of the 
quantum mechanics formulated by Dirac and can have no formal proof. 
Its sole justification is that its use leads to results which are consistent 
with the experimental evidence and that it reduces to the classical 
equation FQ = QF (n) 

t P. A. M. Dirac, Proc. Boy. Soc. A, 109, 642 (1925). 
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in the limiting case ft -> 0. It is assumed that the Poisson bracket 
[F, 6?] is calculated formally as in classical mechanics and obeys the 
same laws as the classical bracket; equation (10) then supplies informa¬ 
tion about the difference between the two products FO , OF of the two 
dynamical variables F and 0 in the quantum theory. 

The quantum Poisson bracket is defined by the relation 

[F,G] = ±(FG-GF)-, (12) 

it can easily be verified that it has the following properties 

[F,F] = 0, (13) 

[F,c] = 0 (c, scalar), (14) 

as well as the properties (7) and (8). 

63. Angular momentum relations for a single electron 

To illustrate the method of calculation of Poisson brackets in quan¬ 
tum mechanics we shall evaluate the brackets corresponding to certain 
of the dynamical variables connected with the motion of a single 
electron. If the electron has coordinates {q^q^q^) and linear momenta 
(PvPz*Pz)> we ass ume that the relations 

\3i> Qj] = [Pi’Pj] = °> [?<• Pj] = 8 « (*>j ==1.2,3) (15) 

of classical mechanics hold good in quantum theory. 

In classical mechanics we define the angular momentum m of a single 
electron by the vector equation 

m = qxp, 

so that its components are 

(%.«!>%) = ((h.Pz-<i3P2’ r hPl-<llP3, f hP2-<l2Pl)- 
From the definition of m it follows that 

= [<7i> gJPk+ftbi. <h\Pi-'h[Pv <h\ 

using the rules (7) and (8). If we assume the relations (15), each of the 
Poisson brackets vanishes except = —[qvPi] — —1, so that 

K.?i] = ?*■ 

Similar calculations lead to the results 

= 0 [ m V ft] = ft [«H>ft] = “ft 

I> 2 >ft] = -ft [m 2 ,g 2 ] = 0 [m 2 ,q 3 ] = ft 

Oft. ft] = ft [m a ,q 2 ] = -ft [m 8 ,ft] = 0. 


( 16 ) 
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Writing r 2 = ql+ql+ql and making use of equations (8) and (16) we 
obtain the result 


[ TOl ,r 2 ] = ? 1 [m 1 ,g 1 ]+[m 1 ,g' 1 ]g' 1 +g 2 [m 1> g' 2 ]+[TO 1 ,? 2 ]y 2 4- 


= 0, 


+S'3K.?s]+[>»i. A 


giving the relations 

[m x ,r 2 ] = [m 2 ,r 2 ] = [m 3 ,r 2 ] = 0. (17) 

Now it can easily be shown from the definition of a Poisson bracket 
in classical mechanics (equation (1) above) that, if [F, (?] = 0 for any 
two dynamical variables F , G, then [F, <£((?)] = 0, where <j>{G) is any 
function of G . If we assume this result to be carried over into the 
quantum theory we have from (17) 


[m v <l>(r)] = [m 2 , <f>(r)] = [ra 3 , <f>(r)] = 0. 
Corresponding to the set (16) we have the set 

= 0 = Pa D»i>Ps] = ~Pi 1 

[«* 2 .Px] = ~Pa = 0 K-Ps] = Pi 

[™a,Pi] = Pa \. m a >Pt\ = —JPi l^s-Ps] = 0 > 


( 18 ) 


(19) 


These identities may be easily proved; for instance, 


\™a<Pi\ = [<hP-i-<IiPvPi\ 

= bi.ft]p*+jri[f»«.i»i]-[yi, pJpi— 


“ft- 

By a method similar to that employed in establishing the relations (17) 
we can show that 

[wij.p 2 ] = [m 2 ,p 2 ] = [m a ,p 2 \ == 0, (20) 

and hence that 


[«*1. x(l>)] = [™2> x(l>)] = K> xb)] = 0 , (21) 

where x(f) is a function of p. 

For an electron moving under the action of a central force the 
Hamiltonian function, H, may be written in the form 

B-i-y.+rw, 

where p denotes the mass of the electron. Hence by equations (18) and 
(21) we have 


[m v H] = [m 2 , H] — [m 3 , H] = 0. 


(22) 
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Furthermore, from the definition of the angular momentum we have 
= [m 1 ,q 3 p 1 —q l p 3 ] 

= [«*i. ijA+yJ:"*!. ft]—[»»!. 


= <hPi— ( hVi = «V 
In this way we establish the relations 

[m v m 2 ] = m 3 , [m 2 ,ra 3 ] = m v = ra 2 , (23) 

from which we may readily deduce 

[m v m 2 ] == [m 2 , m 2 ] = [m 3 ,m 2 ] = 0. (24) 

We have established the relations (16) to (24) by means of the formal 
properties of Poisson brackets in classical mechanics. It is also of 
interest to note that they can be established simply if we make the 
assumption, implicit in the Schrodinger theory, that 


h d 


We then have m 3 = ^-(q 1 — — q 3 — 

3 i\ Hl 8q 2 J2 dqj’ 

and so, by equation (12), 

[m 3 , ry 1 ]'F = ^(w. 3 7 1 -( ?1 ot 3 )'F 



The right-hand side reduces to <7 2 Y, so that the effect of operating on 
the wave-function Y by [ra 3 , q x \ is equivalent to multiplying it by q 2 . 
This may be written formally as 

in agreement with the relations (16). The other relations can be 
established in a similar way. 


64. The angular momentum of a system of electrons 

If the dynamical system consists of several electrons, then the angular 
momentum of the system is taken to be the vector sum of the angular 
momenta of the individual electrons. Thus, if there are n particles with 
angular momenta m (?:) (£ = 1,2,..., ft) the total angular momentum of 
the system is defined by the vector equation 
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If i ^ j, then it follows at once from the definitions that 

= 0, [ m i\ m 2 ) ] ~ 0 (25) 

for i, j = 1,2,...,n. Now for any particle i 

= 1 = K>>#I = 

In this way it may be established that, for i = 1, 2,..., w, 


[M v qV] = 0 

w,#] - & 



1 

II 

SrH 

= 0 

= qf | 

. (26) 

[^3,#] = gif> 

[M 3 ,q^] = -qf 

[M 3 ,qf] = 0 j 


The relations 

[M v p^] = 0 

[M v p^] = pf 

[ M vPa } ] = —P ( 2 ] 


to* 

II 

1 

M S 

[M t ,pf] = 0 


(27) 

[M s ,p?] = P <i> 

[M 3 ,pW] = 

= o J 


(i = 1,2,...,%) corresponding to the set 
established in a similar fashion. 

of equations (19) 

can be 

Furthermore, 

[M v M 2 ] 

= [ j m«>, 2 «?] = 

K = 1 j = l J 

i f,[m { i ) ’ m< i ) J- 


Now by definition 

= m,* 8 y , 


so that 


m 3 , 



giving one of the relations of the set 


[Jf x , = M Zi \M 2 , M z ] = Jf l5 [iif 3 , JfJ - ilf 2 , (28) 

the others being established in the same way. 

Finally, if M 2 is defined by 

M 2 = Ml+Ml+Ml 
it can be shown from equations (28) that 

[M v M 2 ] - [if 2 , M 2 ] - [if 3 , M 2 ] - 0, (29) 

also that [M if H] = [H, M 2 ] = 0 (i = 1,2,3). (29 a) 

Thus the angular momentum identities hold not only for a single electron 
but also for a system of electrons. 

65. The equations of motion f\J 

We have seen that in classical mechanics the Hamiltonian equations 
of motion (3) may be written, in terms of the classical Poisson brackets, 
in the form (4). We now postulate that in quantum mechanics the 
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equations of motion are of the same form except that now the classical 
Poisson bracket is replaced by the quantum Poisson bracket defined 
by equation (12). In this way we obtain the equations of motion 

= jiHqj-qjH), p i = ^(Hpj-pjH) (30) 


(j = 1,2,...,w) for a system with n degrees of freedom. In addition to 
the equations of motion we require new quantum conditions to replace 
the conditions r 

fPjdqs^njh (31) 

of the Bohr theory of the atom. These are obtained by assuming that 
the Poisson brackets involving the coordinates q i and the momenta p { 
have the same values as in the classical theory. Combining the classical 
equations (6) with the definition (12) of a quantum Poisson bracket 
we obtain the quantum conditions 


PjPj PkPj = 0 • (32) 

<]}Pk-Pk<1j ih hk ; 

This method of deriving the quantum conditions (32) is due to Dirac;f 
they have also been derived from (31) by a method making use of the 
correspondence principle4‘ 

The variation with time of any dynamical variable which does not 
contain the time explicitly is taken to be defined by the classical 
equation (5) with the Poisson bracket replaced by its quantum analogue 
(12), i.e. j 


[F,H] = -—(FH—HF). 


(33) 


For this reason any dynamical variable, a, which satisfies the equation 
ih[<x 9 H] = ocH-Hoc = 0 (34) 

is called a constant of the motion. For example, equations (22) show, 
that the components of the angular momentum of a single electron are 
constants of the motion for a central field of force. The Hamiltonian 
is itself a constant of the motion. 


66. Matrix mechanics 

So far we have not specified the nature of the entities q p Pj satisfying 
the commutation relations (32), though it is obvious from the third of 
these equations that they are not ordinary numbers. We have seen 

t P. A. M. Dirac, Proc . j Hoy. Soc. A, 109, 642 (1925). 
t M. Bom, W. Heisenberg, and P. Jordan, Zeits ./. Phys. 35, 557 (1925). 
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previously, however, % that an equation of this type is satisfied if we 
take ^ ^ 

Pi ~idq j 

and interpret it as meaning that the effect of applying the operator 
QjPj—PjQj a function of the p’s and q’& is merely that of multiplying 
by ih. We can, however, interpret the quantities q j9 p,j in quite a different 
way. In the classical theory the coordinate q in any system varying 
periodically with the time with period v can be represented by a Fourier 
series of the type 

q= I q(j)e 27rm , 

j = —oo 

where, to ensure that the coordinate q is real, the coefficients q(j) must 
satisfy the relations = (35) 

for all values of j. In the quantum theory the classical frequencies jv 
are replaced by the spectral frequencies v. nm which satisfy the equations 

v nm~^~ v mk ^ v nk> 

Vnn = °> ( 37 ) 

v mn “ v nrrr 

In setting up a matrix formulation of quantum mechanics, therefore, 
we form, instead of the Fourier series, the matrix of which the elements 

are (m\q\n)e 2iriv ’»» t . (39) 

The quantities (m\q\n) replace the numbers q(j) of the classical theory. 
The matrix element (39) corresponds to the transition of the system 
from the state m to the state n. 

Since v nn = 0 for all n, it follows that the elements along the leading 
diagonal of this matrix are constant with regard to the time. Further¬ 
more, if the matrix is to represent a real coordinate, then corresponding 
to (35) we must have (fl||j|n) = {n]q | m) * (40) 

Matrices with this type of symmetry are called Hermitian ; all the 
coordinate matrices of quantum mechanics are assumed to be of this 
type. The matrix with elements (n\q\m)* is called the Hermitian con¬ 
jugate of q and is usually denoted by q\ A Hermitian matrix is thus 
one satisfying the equation q* ~ q 

If we have two coordinate matrices 

?i — ||(m|g 1 |»)e* rf *"‘|| I 
q 2 = ||(OT|g 2 |w)e 2 ^'”“'|| > 

X p. 36 above. 
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= || 2 (m\q 1 \k)e Mvmt, (k\q 2 \n)e 2 ‘ niv ^‘ || 

= II 2 {m\q 1 \k)[k\q 2 \n)e l ' niVm '' i || 

11 k 11 

by equation (36). On the other hand, 

Mi = || 2 (m}q i \lc)(k\q 1 \n)e 2ni '” n ’ t ||, 

11 k 11 

so that in general q 1 q 2 # q 2 In the case (M 2 = (Mi fh e matrices 

g 1} g 2 are said to commute. 

The elements of the matrices q x q 2 and q 2 q x have the same time factors 
e 2 TTivmnt as an( j q 2# The exponential time factors are usually omitted 
from the matrices; it is to be understood that when an element is 
written (m\q\n) it means (m\q\n)e 2niVmnt . For the equation (33) to have 
any meaning in matrix mechanics it is necessary to define the differen¬ 
tiation of a matrix with respect to the time t. We will assume that the 
time derivative q is represented by the matrix 

q= 11 Zniv mu (m\ q\ri)e 2lTiVmnl | ] (41) 


obtained by differentiating each of the elements of the matrix q with 
respect to t. If the matrix q is constant throughout time, all the 
elements of q must vanish. 

Now by its definition v nn = 0 , so that whatever the value of (n\q\n) 
the diagonal elements of the matrix q will be zero; but v mn (m ^ n) is 
not zero, so that the non-diagonal elements of the matrix q can only 
be zero if (m\q\n) = 0 (m ^ n). 

In other words, the matrix q is a diagonal matrix of the form 

'(1|?|1) 0 0 

0 (2|g|2) 0 . . 

0 0 (3|g|3) . . 


(n\q\n)h u 


Since the Hamiltonian is constant during the motion it must be 
represented by a diagonal matrix 

(i\H\j) = H^. (42) 

It follows immediately that any matrix which commutes with H is a 
diagonal matrix, for if a is such a matrix 

0 = (m\otH—Ha\n) = ^ {(m|a|fc)(fe|H|ri)-“(m|H|4)(/?|a|7i)} 

= (H n -H m )(m\cc\n), 
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which is satisfied only if 

M«l») = 


67. The solution of special problems 

The solution of any problem in matrix mechanics is complete when 
a set of matrices (m\qj\n) has been constructed to represent the 
dynamical variables q satisfying the conditions: 

(i) The matrices are of Hermitian type. 

(ii) They are such that the Hamiltonian matrix derived from these 
matrices is diagonal. 

(iii) They satisfy the commutation relations (32). 

The further conditions are summed up in the equations of motion 

F = ~ ( FH-HF ) 


satisfied by any dynamical variable F. 

In the condition (iii) is interpreted as the infinite matrix 


8 


a — 


0 10 . 

0 0 1 . 


8^ is known as the Kronecker delta. 

To illustrate the use of the matrix method we shall discuss two simple 
problems in which the purpose is to obtain matrix representations of 
constants of the motion in diagonal form, thus obtaining their allowed 
values in quantum mechanics. 


(a) Rigid body rotating about a fixed axis 

Consider the motion of a rigid body capable of rotating about a fixed 
axis. If the body is free from external forces its angular momentum p$ 
and its total energy H remain unaltered during the motion. Since the 
angular momentum is a constant of the motion, we may represent it 
by a diagonal matrix 

If the angular displacement of the body is denoted by 0, then the 
configuration of the dynamical system is repeated whenever 0 is in¬ 
creased by 2mr, where n is any integer. In the equation 

^ = [F,Pe]-%(PeF-Fpe)> 

vre may replace F by e inS and obtain 

(p g e in8 —e in9 pg) — nhe inB , 


(j\Pe\k)-= Pfiik- 


(43) 
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or, in matrix form, 

^{U\PeMMe^\l)~(j\e^\m\Pe\l)} = nh(j\e^\l). 

Using the fact that ( j\po\k ) is of the form (43) we find that the sum 
on the right reduces to a simple multiple of (j\e in& \l) to give the relation 

Vi~Vi = **> 

showing that p i = ( j+ot)h, 

where j is an integer and a is a constant. To show that a is, in fact, 
zero we need only note that positive and negative values of j in this 
equation relate to opposite senses of rotation, but to the same numerical 
value of the angular momentum; that is, 

(j+<x)h = — (— j+a)h 
or oc = 0. 

Finally p } = jh, 

a result which can readily be established by the methods of wave 
mechanics. 


(6) The harmonic oscillator 

Suppose that the coordinate of the particle is specified by a matrix q 


with elements 


O'Iff I*) = (j\a\Jc)e Mv *‘. 


(44) 


The Hamiltonian of a harmonic oscillator is of the form 


H = ^> 2 +i*ff 2 - 

The canonical equations are then 

ff = pIp, p — —i<q- (45) 

Eliminating p from these equations, substituting for q from equation 
(44), and writing k = we obtain 

^(v 2 0-yh)um = 0 . 

This equation is satisfied if we take 

(j\a\k) = 0 (fc#j=F 1), • 

v ij +1 = ~~ v o> — v o- 

Now from the former of equations (45) we have for the matrix p 

(j\p\k) = 2mipv jk (j\a\k)e 2miv » i , (47) 

so that 


i3\pq—qP\ m ) = 2m> £ (v jk —v k Je iniiv * +v ^(j\a\k){k\a\m), 
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and by the commutation rule (32) this must equal Substi¬ 

tuting m = j, making use of equations (46), and of the result 


we obtain 


w jk 


~ v kp 


(j|a|j+l)(j+l l«l j)-(jl“l j—l«l j) 


4^0 ’ 


which may be written 


IO’+i|®b")l 2 —l(j|«lj—i)l 2 = 

>n-v 
jh 


fi 


4tTTfLV 0 

If we assume that |(1 |a|0)| 2 is the first non-vanishing term of the series 
we obtain 

I0>|j~l)| 2 : 


47 T/jLVq ’ 


so that the matrix q has elements 


(j\q\k) = 0 (k^j± 1), 

—■*)= 

(iljli+i) = 

where the /Ts are arbitrary. 

The elements of the matrix p are then deduced from equation (47). 
For the energy levels of the system we have 

(n\H\n) = ~{n\q 2 +^vlq 2 \n) 

= ^ V 2 ( v mn J t v o)\( m \ a \n)\ i 

m 

= 4tt Vo{ | (n— 1 \a \n) | 12 +1 (n+ 11 a\n)\ 2 } 

== {n~\-\)hvQ. 


68. Canonical transformations 

Suppose a dynamical system is completely specified by a set of 
coordinates and momenta pj which may be represented by Hermitian 
matrices. If now the coordinates are transformed to Q i and the 
momenta to Pj in such a way that the matrices of the new coordinates 
and momenta are related to those of the old by the matrix equations 

Qj = 8qj T, Pj — Spj T , 
then [Q } , PJ = S(q } TSp k ~p k T8q s )T. 

Now the old coordinates and momenta satisfy the commutation 
relations (32); it follows from this last equation that the new coordinates 
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will satisfy relations of this type if the matrices S, T satisfy the 
equations ST = TS = 1, 

i.e. if T is the inverse, S~ x , of S. Thus the transformation 

Qi = SijS-\ Pj ~ Spj S~ x (48) 

preserves the form of the commutation relations (32) and is called a 
canonical transformation; it plays a role in the quantum theory ana¬ 
logous to that of a contact transformation in classical mechanics. 

The matrix S must also be of such a nature that the coordinate Qj 
is real. Writing out equation (48) we have 

(™\Qj\n) = g (m|5|fc )( i|^|Z )( Z|^W, 

The two are equal if and only if 

(l\S~'\n) = (n\S\l)*, 

i.e. if S- 1 = S' 

or SS f == 1 . (49) 

For the coordinates Qj to be real the matrix S of the transformation 
(48) has therefore to be a unitary matrix. 

Furthermore, since 

Pj « P.P. = Spj S-'Spj S = Spf S~ l , 

Pj = Pj Pj - SpjS- 'SpjS- 1 = SpjS~\ etc. 
it follows by induction that if F(p,q) is a dynamical variable which is 
a function of p jf which can be expanded in powers of p jy q jy then 
in the new system of coordinates 

F(P,Q) = SF(P,q)S-\ 

and in particular the Hamiltonian transforms according to the rule 
H(P,Q) = SH(p,q)S~ 1 . 


69. Perturbation theory in matrix mechanics 

A perturbation theory can be developed in matrix mechanics just 
as in wave mechanics, and it is often found in applications to particular 
problems that the matrix method gives the required result more quickly 
or in a more elegant form. Suppose that the Hamiltonian function 
of the dynamical system depends on the value of a parameter * whose 
deviation from zero denotes the amount of the perturbation, and that 
this function can be expanded in ascending powers of e in the form 

H(p, q) - H 0 {p, q)+ 2 * nH n(P> ?)• ( 60 ) 

71 =1 
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The problem to be solved in matrix mechanics is the determination of 
matrices p, q which satisfy the commutation relations (32) and which 


are such that 


H{p,q)= W, 


(51) 


where If is a diagonal matrix. We suppose that the motion of the 
unperturbed system (e == 0) has been solved; i.e. we have determined 
matrices p Q) q 0 satisfying the commutation relations and such that 

Ho(Po,qo) = W 0 , (52) 


where W 0 is a (known) diagonal matrix. 

We shall confine ourselves to a discussion of the case in which 
the dynamical system is uniquely specified by a single coordinate q, 
though the analysis may readily be extended to systems with more 
degrees of freedom. We assume that the matrix W can be expanded 


in the form 


W 


W 0 + I e*W n 

n=1 


(53) 


reducing to W 0 when e = 0. 

Transform the variables p 0 , q 0 to p , q by means of the equations 
p = Sp 0 S" 1 , q=Sq 9 S-\ 


where 8 is a unitary matrix. Then, since p 0 , q Q satisfy the commutation 
relations (32), so also do p , q. Furthermore, 

H(p,q) = SH(p 0 ,q 0 )8~ 1 , (54) 


so that by equation (51), 

SH(p 0 ,q 0 )= WS. (55) 

Writing 8 — S 0 + 2 (56) 


and substituting from (50), (53), and (56) into equation (55) and 
equating powers of €, we have 

«-Vo = 0\ 

(S 1 W 0 -W 0 S 1 )-(W 1 S 0 -S 0 H 1 ) = o\ (57) 
(S 0 H 2 -W 2 SJ+^-W, SJ+iS, W 0 -W 0 S 2 ) = 0 ’ 


remembering that H 0 = W 0 . 

From the first of these equations we have 

|){(m|£ 0 |&)(A:|Wi|n)—(wi|tf£|fc)(&|S 0 |ra)} = 0, 

which reduces to ( wl l'8'ol w )(Wo»—^Jm) = 0, 

since W 0 is a diagonal matrix. If the unperturbed system is non¬ 
degenerate, -jf~ W 0n unless m = n; thus (m|5 0 |w) is zero if m ^ n, 
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so that S 0 is a diagonal matrix. To determine the diagonal elements 
we must now make use of the fact that S is unitary, i.e. 

whence 


1, S 0 Sl+S,SI - 0, S 0 Sl+S x S[+S 2 SI = 0. 

(58) 

From the first of the equations (58) we see that (w|$ 0 |ft) is of the form 
e i,x n which can be absorbed into the usual (unwritten) time factor. We 
thus may write 




(59) 


Substituting this value for (m\S 0 \n) in the second of equations (57) we 

have = (»|Wi|»)-(»|flil»). (60) 

from which it follows that H l is not a diagonal matrix and also that 

(n\W x \n) - (n\H x \n). (61) 


Also from equation (60) we have for m n 

[m\S 1 \n)= ( p^L. (62) 

"0m "on 

To find the diagonal elements (nl/SJn) we make use of the second 
of equations (58); since S 0 is a unit matrix this reduces to 

S x +S{ = 0, 

giving (n|^|n)+(»|Sf|n) = 0, 

i.e. (n\$i\ n ) — 6. 

From the third of equations (57) we have on substituting for the 
matrix elements of S Q , H 0 

(m|W£|w) = (m\H 2 \n)+ ^ (m|^|i;)(A:|fli|n)+(m| W 1 \m)(m\S 1 \n)+ 

+ {m\S 2 \u){W 0m ~W Qn ). 

Putting m = n and substituting from equation (62) for (mlS^k) we 
obtain an expression for {n\W 2 \ri). Writing 

K = Wor.+cinlW^+e^n^l^+O^) 

we have finally 

w n = ir te +.(*|J^|»)+«*{(»|^|n)+ 2 

(63) 


which is of the same form as equation (25) of Chapter III. 
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70. Dirac’s generalization of matrix mechanics 

It is well known that eigenvalues of a certain operator may contain 
a continuous sequence of values as well as the discrete ones; for 
example, in the case of the energy this accounts for the existence of 
the continuous spectrum. To account for phenomena of this type in 
matrix mechanics we must extend our matrix calculus to include 
systems in which the elements of the matrices will have indices which 
are continuous in one region and assume only certain discrete values 
in another. The methods developed by Diracf to deal with this problem 
will now be illustrated in a simple case—that in which the dynamical 
system possesses one degree of freedom; the extension to systems with 
more degrees of freedom is purely formal. 

The first point to be observed is that in any region in which the 
indices vary in a continuous fashion the elements of a matrix (£il/1£ 2 ) 
are each functions of the two variables and £ 2 . In this region the 
matrix law of multiplication becomes 

(filM.) = / (fil/IWtf.tftlfi'lf.). (64) 

the integration being taken over all the values of f 3 in the continuous 
range. In the case of mixed ranges the product will be the sum of an 
integral and a sum over discrete values. In the continuous case the 
place of the unit matrix is taken by the Dirac 8 -function which is 
defined by the following relations 

00 

8(;r) = 0 (x 0), J 8 (x)dx — 1 . (65) 

— 00 

From this definition the following properties may be derived directly 
or as a result of an integration by parts 


00 


— oo 

(66) 

— 00 

(67) 

00 

J 8(ct-mp-$)d£ = 8(«-j8). 

(68) 


— 00 


From equations (64) and (66) it follows at once that 

t P. A. M. Dirac, Proc. Roy . Soc. A, 113, 621 (1927). 
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which shows that in the continuous range the elements of the unit 
matrix are &I1&) = S&-&). 

Similarly the elements of the diagonal matrix 

£ 0 0 . 

0 f 0 . 

0 0f'. 

are (£\ «|f) = f 8(f-f ). 

If we write (f|j8|f) = Js'tf-f), 

l 

then it follows immediately from the definition (04) that 

= jf {^S(|-n8'(f-f)-f8'(^f)8(f-f)}df. 

Integrating the second term by parts and the first by the rule (68) 
we have 

(fitf-jfai n = -4 J stf-mr-fw = «s(*-n, 

which may be written in the form 

cxj 8 —— ih 

to show that oc and ft are canonical conjugates. 

The fundamental problem of the generalized matrix mechanics is the 
discovery of a transformation of the coordinate system which leads to 
a diagonal matrix for the Hamiltonian of the system. Suppose, for 
instance, that we have a representation in which the generalized 
coordinates and momenta are of the form 

(fi?in = m-n mn = jW-n 

Suppose we change the coordinates and momenta to (rj\Q\y]') and 
(rj\P\rj f ) by a transformation 

q = 8Q8-\ p = SP8 - 1 

to make the Hamiltonian function H a diagonal matrix, say W v 8(*7~- •>?')• 
Then, by (54), _ SH(p> 

j m(p,g)in<t£'(mv) = j mvWwiHiv) 



that is, 
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The integral on the right-hand side can be evaluated immediately to 

glve { m\ m'tf'i-sfo) - (69) 

Equation (69) is Dirac’s generalized form of the Schrodinger equa¬ 
tion.! ft assumes a simpler form when the Hamiltonian H(q,p) is an 
algebraic function of the variables p, q\ for from equations for p, q and 
the results (66) and (67) 


from which it follows by induction that, if f(q,p) is an algebraic function 

P> r I b d\ 

I mfmmv) =/((,; £)mv)- m 


Applying this result to equation (69), writing y = n and (£|$|t 7 ) = 8 n (£) 9 
we ha ve 

(71) 




which is the usual Schrodinger equation. The elements of the trans¬ 
formation matrix S, which leads to a representation in which the 
Hamiltonian H is a diagonal matrix, are the eigenfunctions S n ((j). 

The elements of the matrix representing any other dynamical variable 
F are given by the formula 


ivWW) = J { (f \F(q,pW)d£"(nSW) 

= J(i§is-»if)drj , (f *^)(fi%') 

by equation (70), provided, of course, that F is an algebraic function 
of the p and q. Now from the condition (49) we have 

) = (f \S\v)*, 

so that writing rj — n, (£\S\r]) = <$„(£) we have 

(n\F\n') = j S*(frF^^SA0d$. 


t P. A. M. Dirac, Quantum Mechanics , 2nd ed., p. 116. 
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PROPERTIES OP ORTHOGONAL FUNCTIONS 

1. Introduction 

A set of complex functions T',(x) of the real variable x is said to 
be orthogonal in the interval a ^ x ^ /J, if all the functions of the set 
are such that the integral 

s 

J 

a 

exists for all values of n, and if 

j? 

J Y* (x)Y n {x)dx = 0 (m^n). 

(X 

The quantities ql and jS may be zero, finite, or ± 00 . The simplest 
functions of this type defined in the interval — tt ^ x ^ tt are cos nx 9 
sinw#, where n is an integer. The most important property of ortho¬ 
gonal functions is that under certain conditions we may expand any 
arbitrary function f(x) as a series of the type 

fix) = 2« n ^(*) 

n 

analogous to Fourier scries. Using the orthogonal property it follows 
immediately that the coefficients a n are given by the formula 

/ f{xY¥l{x)dx 
°* = 1 

J Y n (x)Y:(x)dx 

a 

If, for all values of n , p 

j |Y„(z)| *dx = 1, 

a 

the functions %(#) are said to be normalized ; in this case the formula 
for the coefficients a n becomes 

a n = jf(x)Y*(x)dx. 

Ot 

Orthogonal polynomials are of great importance in quantum me¬ 
chanics. The theory of such polynomials has been treated in great detail 
by Szeg6.f No such rigorous treatment will be attempted in this 

t G. Szego, Orthogonal Polynomials (American Math. Soc. Coll. Publ. XXIII, New 
York, 1939). 
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appendix; we shall confine ourselves to the derivation of such properties 
of orthogonal polynomials as are useful in the solution of certain 
elementary problems of wave mechanics. 


2. Hermite polynomials and functions 

In quantum mechanics the Hermite polynomials arise in the solution 
of the Schrodinger equation for the linear harmonic oscillator. It was 
shown in § 11 that, in this instance, the wave equation could be reduced 
to the form 


d 2 Y 
dy 2 


+ (A-y 2 )Y = 0. 


( 1 ) 


We have to determine the values of the parameter A which lead to 
solutions which tend to zero exponentially as y tends to infinity. The 
solution was obtained by making the substitution 

Y = ( 2 ) 

and it was shown that if A = 2 n+1, (3) 

then H(y) is a polynomial of the form 

H n (y) = 2,A g y* (A n 4 0 ). (4) 


8 = 0 

Substituting from equation (4) into the equation 
d 2 H dH 0 jj A 


dy 2 dy ' “ ^ 

obtained from jj. 51 by putting A = 2 w-fl, we obtain the recurrence 

_ 4s—l) 

^8 -2 


relation 


2 (*-»- 2 ) * 

for the coefficients of y. Taking A n to be 2 n , we obtain the solution 


H n (y) = 2* r- 


+ «(»-l)(n- 2 )(n- 8) .^_ 4 




4 ° ' .4.8 " " r (6) 

The polynomial defined by equation ( 6 ) is called a Hermite polynomial , 
and the corresponding function 

X(y) = e-» y'H n (y) 
is called a Hermite function. 

From the form ( 6 ) of the Hermite polynomial it is readily verified 
directly that , n 


H n (y ) = (-l)»c»*: 




( 7 ) 


Differentiating both sides of this equation with respect to y we obtain 
the recurrence relation 

K(y) = ^yH n {y)-H n+1 (y). 


( 8 ) 
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A second differentiation yields 

H'n(y) = 2yH' n (y)+2H n (y)-H' n+1 (y). (9) 

Now H n (y) is a solution of equation (5), so that 

H"niy) — 2yK(y)-2nH n (y). (10) 

An expression for H' n+1 (y) in terms of H n + 2 (y) and H n+1 (y) can be 
obtained from equation ( 8 ) by replacing n by n+ 1 ; it is 

K+i(y) = 2 yH n+1 (y)-H n+2 (y). (11) 

Eliminating H f ^(y), H' n (y), and H' n+1 (y) from equations ( 8 ) to ( 11 ), we 
obtain the recurrence formula 

2 V H n +1 (y) (y)+2(n+l)H„(y). ( 12 ) 

The first two Hermite polynomials can be determined readily from 
equation (7); the remaining ones can then be found by repeated use 
of equation ( 12 ). In this way we find for the first eleven polynomials: 

H 0 (y) = i, 

H i(y) = 2 y, 

H z (y) = 4y 2 -2, 

HM = %*-12y, 

Hi(y) = 16y 4 —48j/ 2 4-12, 

H & (y) = 32jy 5 —160j/ 3 +120t/, 

H«{y) = 64?/«--480,y 4 +720j/ 2 -120, 

// 7 (y) = 12% 7 — 1344y 5 -f 336(fy 3 -]68(ty, 

//„(«/) == 256y 8 -3584«/«-f 13440y 4 -13440«/ 2 +1680, 
if 9 (y) = 612y®-9216y 7 +48384y 5 — 80640y 3 +30240t/, 

H 10 (y ) = 1024y 1( '-23040t/ 8 +161280y«-403200j/ 4 +302400y 2 -30240. 

3. Integrals Involving Hermite functions 

To determine the normalization factors of the eigenfunction of the 
linear harmonic oscillator we require to know the value of the integral 

00 00 

I(m,n) = j y ¥ n (y)Y m {y)dy = f e~v‘HJy)H n (y)dy. (13) 
— 00 —00 

The proof that I(m,ri) vanishes unless m — n follows at once from the 
fact that the function T„(y) satisfies the differential equation 

dy 2 


y 2 T n +(2n+l)Y n = 0, 
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SO that 


oo oo 

j'r m (y)'r n (y)dy = — l^J T n —T"T to ) dy, 


which is zero unless m — n. 

In particular, l(n,n+2 ) = 0, so that from equation (12) it follows 
that 

J 2ye~v'H n (y)H n+1 (y)dy = 2(n+l)I(n,n). (14) 

— 00 

Now the left-hand side of this equation may be written 

— oo —oo 

which gives the recurrence formula 

7(71+1,71+1) = 2(n~\~\)I{n i n). 

00 

Now 1(0,0) = J e- yi dy = Vtt, 

— 00 

so that by induction we arrive at the formula 

I(n,n) — 2 n n\ VW. (15) 

Thus the normalized eigenfunctions of the Schrodinger equation 
d 2x F 2m T „ j 2 UJ . 

*?+ w (V-iP>')'r = o 




where a = (2mp/li 2 ) i . J 

The matrix element (n\x\m) for the harmonic oscillator is, by defini¬ 
tion, given by the equation 

00 oo 

— OO —00 

Making use of the recurrence formula (12) we find that 
00 

J ye-v'H n (y)H m (y)dy — $I{n,m+l)+nI{m,n—l), 

— 00 

and henoe by use of the orthogonality relation that 

(■ n\x\m ) = 0 
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unless m = n±l. Putting m — w+l and inserting the value of 
J(n+1> w+1) as given by equation (15), we have finally 


( »'*i (n > 


4. Laguerre polynomials 

Closely related to the Hermite polynomials are the Laguerre poly¬ 
nomials which may be defined by the equation 

m) = $=[ e ‘; ft ({ ' e_f) ]- (18) 

A comparison of equations (7) and (18) shows that the Hermite poly¬ 
nomials can be reduced to Laguerre polynomials with parameters 
n = Conversely, Laguerre polynomials can, to a certain extent, 
be reduced to Hermite polynomials, by the relationf 

I <<» > -»>■ 

(19) 

The Laguerre polynomials are most easily calculated directly from the 
definition (18); the first ten polynomials are found to be: 

mo = i, 

m) = i~z, m)==- 1 , 

Ll($) = 2-4f+f 2 , 24(f) = -4+2f, 24(f) = 2, 

mo = «-18f+9f*-f», 24(f) = -18+18f-3f\ 

mo = 18-6f, 24 (f) = -6. 

It also follows directly from equation (18) that L£(£) is a solution of 
the ordinary differential equation 

^ + (n+l-!)fg + (k-n)y = 0. ( 20 ) 

Now if we write x = , g = £ /+1 6r, 


we find that the equation 


S-sS+ 

da: 2 dx ~ 




= 0 


( 21 ) 


occurring in the theory of the hydrogen atom (equation (18) of Chapter 
II) is transformed to 


f H + [2(l+l)-Z]~ + (l*-l-l)G = 0, 


(22) 


t A. W. Uspensky, Ann. Math. (2), 28, 593 (1927). 
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which is identical with equation (20) if we choose 
n = 2Z-J-1, k = 2 1+8, 

where s is a positive integer and a = 2(Z+s). Thus the solution of 
equation (21) may be written in the form 

9 = (Zxy+'Lf^x). (23) 


The integral l(£)I$(£)d£ 

0 

is of importance in wave mechanics; its value is given by the following 
formula due to Schrodinger 

Kn~.k,u = (—l) fc+fc X! k'l ( p—n )\X 


cr 

2 fi\ (k~ 


{k-n—s)l(p- 


_ (?+*)! 

-k+s)\(le'- 


—ev)! ' 


(24) 


where a is the smaller of the two integers k—n, k'—n\ 

In determining the normalization factors for the eigenfunctions of 
the hydrogen atom we require the value of the integral 


0 

In this instance h—n = lV—n’ = s— 1 and all the terms in the sum on 
the right-hand side of equation (24) vanish except those for which s 
takes on the values ^— 1, $—2. Equation (24) then yields for the value 
of this integral 

J ^e-t{l4\'M)yd£ = M. (25) 

0 


5. The Legendre polynomials 

An equation of great importance in mathematical physics is the 
Legendre equation 

(1_F2) 0~ 2 ^ + ^ +1)y = 0 ’ (26) 

in which l is a constant (but not necessarily an integer). If we wish to 
obtain polynomial solutions of this equation of the form 

y=1 a ,H‘ e («* # °)> (27) 

a-0 

then substituting from equation (27) into equation (20) and equating 
to zero the coefficient of (i n we obtain the relation 

a n [n(»+l)—1(1+1)} = 0. (28) 
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Since by hypothesis a n ^ 0, it follows that polynomial solutions of 
equation (26) of the type (27) are only possible if l is a positive integer. 
Substituting from (27) into (26) and equating coefficients of p 8 we obtain 
the recurrence formula 


n — ( s^H^+^+l) 
s+2 >+l)(H-2) * 


(29) 


for the coefficients a 8 , whence we derive the polynomial solution 


= 2 


s -0 


(~l)*(2w—2s)! 
2 ,, 6*! (ft—6*)! (ft—2s)r 


( 30 ) 


where the integer a is \n or \{n— 1) according as n is an even or an 
odd integer. P n (^) is called the Legendre polynomial of degree ft. 
From this definition of the Legendre polynomial we deduce the 
Rodrigues formula 


PnW = 2 


( — 1 )* 

2"s\(n —«)! dfi n 


{n 2n ~ 2s ) 


1 d - y 

2 n ft! dfi n (ft—sj! 


1 d" 


2”ft! d/x n 




(31) 


6. Integral expression for the Legendre polynomial 

If we assume Cauchy’s theorem in the form 



c 


fit) 


«--/(/*). 


where/(f) is an analytical function of the complex variable in a certain 
domain R which includes the point f = /x, and where the integral is 
taken along a closed contour G which includes f = /x and lies wholly 
within the domain R, then by differentiating both sides of the equation 
l times with respect to /x we obtain the result 



m 

U-rf +l 


dl 


Substituting the function (£ 2 —1)* for /(£) we obtain, by an application 
of Rodrigues’ formula, 



(C 2 —l ) 1 


dl 


(32) 
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Equation (32) is known as Schlaefli’s formula for the Legendre poly¬ 
nomial. Using the identity 

2 -<-i(£ 2 _iy+i tt [ ({ 2 -i y 2 —'({ 2 _i y 2- 1 - 1 d it 2 - iy +1 

(C—ja ) ,+ 2 M ({-^ +1 “ ({-/*) <+r l+ldt\l-pj 

and noting that if the second term on the right-hand side of this identity 
is integrated round a contour of the type specified at the beginning of 
this section it gives a zero contribution, we arrive at the relation 

Differentiating both sides of this equation with respect to fx we obtain 
the recurrence formula 

-Pf+x(M)—== (1+1X150^). (33) 

In a similar fashion we can establish that 

(J +1 )Pi+i(p) —(2/ +1 )pPi(fi) + IPi-iip) « 0. (34) 

Differentiating this last equation with respect to fx and substituting for 
P'i +1 (ix) from equation (33), we obtain the relation 

txPKri-P'^fx) = Ufa). (35) 

Elimination of fxP[{fx) from equations (33) and (35) yields the result 

PiM-P'i-iM - (a+i«(#i), (36) 

and finally the elimination of P\„ x from (35) by means of equation 
(33) with l—l written for l gives 

(fx 2 — l)P'i(fi) * I^-IP^). (37) 

Now the expressions for P 0 (^) an d P^jx) can be derived immediately 
from either of equations (30) and (31); they are found to be 1 and (x 
respectively. The expressions for P n (fx) (n > 2 ) then follow by repeated 
applications of the recurrence formula (34). In this way we obtain for 
the first six Legendre polynomials: 

Po(n) — 1 » PM — 3#t). 

PM — PM = i(35fi 4 —30/i 2 +3), 

PM = i(3/x 2 -l), PM = J(63p, 5 —70p 8 + 15/i). 

7. The associated Legendre polynomials 

The equation 

(S8) 

which reduces to the Legendre equation (26) when u — 0 , also plays 
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an important role in mathematical physics. By obtaining solutions in 
series in the same manner as in the last section it is easily verified that 
polynomial solutions of this equation exist if and only if u and l are 
integers. In the subsequent analysis we shall assume that this condition 
is fulfilled and also that u < 1. 

If we make the substitution 


V — (!— P 2 ) 1 " 1 ? 

in equation (38), we find that the function 7)(/x) satisfies the differential 
equation 

//2/irt /'/on 

0 , 




and this may be written in the form 
d u 

dfx u 


( 1 - 


m 2 )K-2 tS+ni+w 


o, r) 


<n 

d,^ -'"dp • • ' /s i ” ' 

by the use of Leibnitz’s theorem. Now the expression in the bracket 
on the left-hand side of the former of these two equations is simply 
the expression which is equated to zero in Legendre’s equation (26). 
Hence if £ is any solution of the Legendre equation (26), the function 

JW 


y = (!-/**)«• 


d/ju u 


is a solution of the associated Legendre equation (38). The polynomial 
form of rj corresponding to £ = JJ(/z) leads to the associated Legendre 
function which is thus defined by the relations 

PIM = (39) 

the second of these expressions being a consequence of the Rodrigues 
formula (31). 

As a result of equation (39) the recurrence formulae for the associated 
Legendre functions can be determined immediately from those for the 
Legendre functions. Differentiating equation (34) u times with respect 
to fi and equation (35) u— 1 times and eliminating d u ^ 1 P l (y f )/dfi u ~' 1 from 
the resulting equations we obtain the relation 

yPf(y) = Bj* JT- AWri- (*>) 


Similarly if we differentiate (36) u times with respect to p and then 
multiply by (1—p 2 )* (1+u) we obtain the recurrence formula 


(X-pWfc) = (41) 
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The associated Legendre functions can be deduced from the table 
of Legendre polynomials at the end of § 6 by means of .the former of 
the relations (39). In this way we obtain: 

= (i-M 2 )*. 

PKfi) = 3/4(l-/4*)>, PUh-) = 3(1-^), 

PUr) = U 15^-3)(l-^ ; P§(/x) = 15^(1-^), Pfo) = 15(l-/4*)», 
Pft/*) =* i(35^-15 / x)(l- / * 2 )i > P|(/x) = *(105/**-15)(l- / **) > 

PJ(m) = 105/x(l —/x 2 ) } , Pi(fi) = 105(1—/*■)», 
PJO*) = *(315/i«~210/i*+15)(l-/*«)♦, 

PJ(M) = K315/x 3 -105 / x)(1- M 2 ), 

= l(946/**-106)(l-/*»)», 

P 6 V) = 945/4(1 -/x 2 ) 2 , 

P£(/x) = 945(1-/x 2 )«. 


8. Integrals involving the associated Legendre functions 

i 

If we write /“$*’ = J P]*(/i)P|f (/x) ri/x, (42) 

-i 

then, by equation (39), 

/«,«_ ( *)'* f ( u n ] \it d l+U / 2_W ^ +U /m2_ l) l 'dn 

-l 

~1 

as a result of J'-fw integrations by parts. Expanding the second factor 
in the integrand by Leibnitz’s theorem we find 


w 


(—l)2u+4- 

IF? 


L/‘“ T * v (!'+«)! 

Iff! 


(y+ w _s)! 


-1 

Now the term of highest order in the expansion of (/x 2 —l) w is fx 2u , so 

(Jl'+U—8 

that g-y— ■ — (/i 2 — l) u vanishes if a > i'—Similarly the second factor 

in the integrand vanishes if a < Z—«. Since the integral is symmetrical 
there is no loss of generality involved in the assumption that l < V, 




$ 8 INTEGRALS INVOLVING ASSOCIATED LEGENDRE FUNCTIONS 385 


but if Z < V the above two relations limiting the values which s is 
capable of assuming are incompatible, so that 


= 0, Z # V. (44) 

If Z = l\ then s = l~u is the only term of the series (43) which gives 
a non-vanishing contribution to the final result. We then have 


ru.u_ (~1) 2U+ * f 

° 2 sa (Z!) 2 (l—u)\(2u)\ J 

(Z+m)! 2 

— (i—u)\ 2 z+r 


(n 2 -iy(2w)!(2Z)!cZ/x 


(45) 


The analysis for proceeds along similar lines; again we find 

that the integral vanishes unless l = l'. 

By means of the recurrence formula (40) we have 
1 

-1 


so that, by equation (44), 

1 

J o ( 46 ) 


unless Z' = Z+l. When Z' = Z±1 the value of the integral follows 
immediately from equation (45). 

Similarly the recurrence relation (41) yields the formula 

i 

j (i-^)»“Piv)Pj{±V)^ (47) 

-i 

so that this integral also vanishes unless 

V = Z±l. 


9. Some expansions involving Legendre functions 
From Schlaefli’s integral formula (32) for the Legendre polynomial 
P n (fi) and the definition (39) of the associated Legendre function, we 
see that when m and n are real integers 




1 (n+m)l 
2m 2 w n! 



(£ 2 -l) ro 

(£- j a ) w + m + 1 


dl 


(48) 


c 

where the contour C encloses both of the points £ = 1 and £ = ja. 
If /a > 0 we may take the contour C to be the circle 
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Integrating round this contour we obtain from (48) the equation 


n ! cos . Dm/ x 


(n+m)!sin 


2ir 

2^ J {m+V(m 2- 1)c°s(^— ifi)} n ™*m<f>d4 = 

0 

from which follows immediately the Fourier expansion 
{m+vV— 1 )coB(tf>—ifi )}* = P„(/*)+2 ^ 7 n + m y P%(n)cosm{i ft 


(49) 


(n+m)\ 

Changing n to — (tt+l) we obtain the expansion 

V» (»-”)! 


{m'+V(m' 2 -1)cos^}-»-i = P„(/x')+2 2 (-1)” 


ra^l 


n\ 


(50) 

P2V)coswnft, 

(51) 

where, as noted above, we have assumed n to be an integer. 

Applying Parseval’s theorem for Fourier series to the series (50) and 
(51) we find that the series 

P»P„(fO+2 j> (—!)” P^)P£V)cosm,£ 

converges to the sum 

_L f fc+vV—i) 00 *# +<ft) }” , 

2tt J {/+V(// 2 -l)cos$»+i ^ 


This integral may be evaluated by means of Cauchy’s theorem.f Its 
value is found to be ^[jLt/Lt'+^^—lJ^^—lJlcos^]. Writings = cos0, 
yJ == cos#', and 

cos© = cos#cos#'+sin#sin#'cos<£, 
we obtain the result 

P»(cos@) = P n (cos0)P„(co S r)+2 2 P^(cos0)P*(oos 6'). 


1 


(52) 

This result is often of value in the solution of problems in quantum 
mechanics* 

Another expansion which is of use in the relativistic theory of the 
Auger effect^ and of the interaction of two electrons in the self- 
consistent field method§ is the following: 

- f (2n+l)^^ JhSta)p n{ cosi^), (53) 

t E. W. Hobson, The Theory of Spherical and Ellipsoidal Harmonics, 366-71 (Cam* 
bridge, 1931). 

% H. S. W. Massey and E. H. S. Burhop, Proc. Hoy . Soc. A, 153, 661 (1936). 

§ B. Swirles, ibid. 157, 680 (1936). 
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where r a is the smaller and r b the greater of r x and r 2 and 
Vn(x) ~ \K±™)Jn + l(x) \ 

£.(*) = M”*){M n+i (x)+(-i) n J-n-M t 

A similar expansion for the real exponential function can also be found; 


(54) 


(55) 


n=0 


where r a and r b are defined as before. In equations (54) and (55) the 
functions I m (z ), and K m (z) denote the usual Bessel functions in 

the notation adopted by Watson, f 

Partial differentiation of equation (55) with respect to k leads to 
other expansions of importance: 


e -fcir,-r,! = V ^±i^ n (i,r 6 ; r a )P n ( cosr^), (56) 

n^O^ 1 2 ' 

|ri-r 2 |e-*i r <- r *' = V ?^±lq n (k,r b ; r a )P n ( cosi^* 2 ), (57) 

where 

Pjk, V> r a ) = r b 4 +i (^„)iC n+ ,(AT 6 )-r 0 4_ 1 (*r„)X'„ +i (A:r 6 ) (58) 

and 


<lJM,r b \ r a ) = (r£+rg)/ n+t (fcrJJ)T n+i (AT 6 )- 

- ^ l n ^{kr a )K n ^{kr b )- r a r 6 I n+l (kr a )K n+i (kr b ). (59) 

If formulae for higher powers are needed they may be obtained by 
further partial differentiations with respect to k . These expansions 
facilitate the evaluation of certain complicated integrals occurring in 
the theory of molecular structure. J 


10. Surface spherical harmonics 

From the two sets of orthogonal functions Pf(oos 0) and e iu< ^ we can 
form a third set of functions 

yaw = («>) >. 

which is an orthogonal set on the surface of the unit sphere, i.e. the 
functions of the set satisfy the integral relation 

/ sin0 d$ J Yf u Y t . u . <fy = 8 ff 8 ttU .. 
o o 

f G. N. Watson, The Theory of Beeeel Function* (2nd ed., Cambridge, 1944). 
t C. A. Coulson, Pt oc. Comb. Phil. Soe. 33, 104 (1937). 

oc2 


( 61 ) 
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The functions Y lu {0,$) possess the further property that they are 
solutions of Laplace’s equation in polar coordinates when djdr is taken 
to vanish identically. Furthermore, they satisfy a large number of 
recurrence relations which may readily be written down from a know¬ 
ledge of the recurrence relations satisfied by the associated Legendre 
functions Pf (cos 0), Some of these results which arise frequently in 
calculations in quantum mechanics are given below for reference. 
Their derivation follows directly from the results of the previous 
sections. 


(L x +iL u )Y liU (9,<t>) = h{(l-u)(l+u+l)}iY ltU+1 (e,<l>), (63) 

(L x -iL u )Y Uu (0, <f>) = h{(l+u)(l-u+ ltyY^O, 4), (64) 

Lt Y lM (0, <f>) — mhY lu , (65) 

L% u (6,<t>) = l(l+l)h% u (8,<l>). ( 66 ) 
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Absorption coefficient, 253. 

— of radiation, 253, 255, 266. 

Airy integrals, 18. 

Alkali atoms, 100. 

— halides, 164, 201. 

Alpha decay, 17, 22. 

— particles scattered by helium, 113. 
Analytical wave functions, 142. 

Angular momentum, 44, 85, 86, 305, 306, 

359-62. 

Annihilation of positrons, 275, 313. 
Anomalous Zeeman effect, 91, 100. 
Associated Legendre polynomials, 382. 
Atomic size, 41. 

— units, 127. 

Auger effect, 342. 

Benzene molecule, 198, 200. 

Bohr magneton, 88. 

— theory of atom, 44. 

Bond angles, 194. 

Bonding orbits, 199. 

Bom approximation, 238, 239. 
Bose-Einstein particles, 354. 

Bragg reflection, 212. 

Breadth of spectral lines, 268. 
Bremsstrahlung, 275. 

Brillouin zone, 212. 

Butadiene, 200. 

Canonical transformations, 368. 

Carbon atom, 150. 

— molecules, 194. 

Centre of force scattering, 234. 

Chlorine ion, 140. 

Classical Poisson bracket, 356. 

— theory of radiation, 275. 

Classification of radiation processes, 274. 
Coherent scattering of light, 274. 

Cohesive forces, 201, 220. 

Cold emission, 219. 

Collective electron model, 215. 

Collisions, 233 et seq.; between electrons 
and atoms, 242; between fast particles, 
339; between identical particles, 113; 
between molecules, 162. 

Colour centres, 226. 

Commutation relations, 36, 363. 

Complex molecules, 192 et seq. 
Compressibility, 202. 

Compton effect, 4, 33, 190, 274, 316. 
Conduction band, 223. 

Conductivity, 222. 

Conductors, 214. 

Constant of motion, 363. 


Contact between metal and insulator, 229; 
between metal and semi-conductor, 230. 

— transformations, 357, 369. 

Continuity equation, 259, 294, 309. 
Continuous spectrum, 60. 

Cosmic ray phenomena, 289. 

Coulomb scattering, 114, 241. 
Cross-section for radiation processes, 267 

et seq.; for collisions, 233. 

Crystals, 201, 207, 226, 227. 

Darwin’s variation principle, 308, 353. 
Deflexion of electron beams, 10. 
Degenerate states, 49, 72. 

Delta function, 95, 277, 372. 

Deuteron, 122. 

Diamagnetic susceptibility, 141. 

Diatomic molecules, 103, 110, 162, 176. 
Differential analyser, 125. 

Diphenyl, 200. 

Dipole moments, 257. 

Dirac wave equation, 296, 322, 351; 
matrices, 298. 

Dirac’s generalization of matrix mechanics, 
372. 

Distribution of momentum, 19, 34, 59, 69, 
190. 

Double scattering of electrons, 332. 
Doublet separation, 100. 

Dual nature of electrons, 7; of light, 4. 

Einstein-Bose statistics, 107. 

Einstein coefficients, 253. 

Electrical conductivity, 222. 

Electron affinities, 204; beams, 10; in a 
box, 46; spin, 89. 

Electronic motion in crystals, 207; in non- 
metals, 223. 

Emission bands, 215. 

— of radiation, 253, 258, 265. 

Energy levels, 42; by perturbation method, 
118; in a magnetic field, 87; of hydrogen 
atom, 44, 57, 323; of lithium atom, 57. 
Energy of a many-electron system, 133. 
Ethylene double-streamers, 197. 

Exchange integral, 119, 134. 

Excitation of atoms by a-particles, 248. 
Excited states, 41. 

Exciton, 227. 

Expansion theorem, 65. 

F-ventres, 226. 

/-sum rule, 169. 

' Fermi-Dirac statistics, 107. 

Field quantization, 259. * 
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Fine structure formula, 326. 

First-order radiation processes, 274. 
Fock’s equations, 122, 135, 214. 

Fourier transforms, 68. 

Free particle in Dirac theory, 310. 
Frenkel defect, 207. 

Gamma-ray microscope, 33. 

Gauge invariance, 291. 

Generalized exchange integral, 134. 
Graphite, 214. 

Green’s theorem, 28. 

Group velocity, 24. 

Hall effect, 224. 

Hamiltonian equations, 37. 

— of radiation field, 259. 

Harmonic oscillator, 50, 53, 261, 367. 
Heitler-London method, 176, 178, 188. 
Helium atom, 112, 118, 131, 143. 

Hermite functions, 62, 328, 376. 
Hermitian matrices, 37, 300, 364, 
Hexatriene, 200. 

Higher spin, 353. 

‘Hole’ theory, 291, 312. 

Homonuclear molecules, 110. 

Homopolar forces, 164, 204. 
Hume-Rothery rule, 222. 

Hybridization of wave functions, 194. 
Hydrogen atom, 44, 54, 102, 323; mole¬ 
cular ion, 184, 190; molecule, 176, 178. 

Impure crystals, 226. 

Impurity centres, 225. 

Incident wave, 235. 

Inelastic scattering, 245. 

Insulators, 214. 

Intensity of spectral lines, 253. 
Interatomic forces, 162 et seq. 
Interference, 1. 

Internal conversion, 345. 

Interstitial atoms or ions, 204, 205. 
Invariant form of Schrodinger’s equation, 

6l. 

Ionic crystals, 201. 

— radius, 140, 203. 

Ionization energy, 42, 266. 

Isotopic spin, 123. 

Klein-Gordon equation, 295, 297, 304, 354. 
Klein-Nishina formula, 287, 316. 
Kronecker delta, 146. 

Laguerre polynomials, 379. 

Land© ^-factor, 92, 113. 

Lattice energy, 162; parameter, 202; 
vibrations, 204. 

Legendre polynomials, 55, 380. 


Lennard-Jones perturbation formulae, 78. 
Lithium atom, 57. 

Lorentz invariance, 300. 

Madelung constant, 202. 

Magnetic field, 39, 84, 87. 

Magneton, 88. 

Many-electron atoms, 127 et seq. 

Matrix elements, 72, 263, 374; equations 
of motion, 362; mechanics, 356 et seq. 
Matter waves, 6, 24. 

Maximum overlapping, 192. 

Meson theory of nuclear forces, 124, 239. 
Mesons, 289. 

Metallic cohesion, 220. 

— forces, 164. f 

Methane molecule, 196. 

Molecular energy curves, 189. 

— orbitals, 175. 

Moller-Rosenfeld theory, 124. 

Momentum density functions, 34, 59, 69, 

190; operators, 36; space, 34, 69. 

Naphthalene, 200. 

Negative enorgy states, 291, 310. 
Neutron-proton scattering, 126. 
Non-stationary phenomena, 27. 

Normal Zeeman effect, 89. 

Octatetraene, 200. 

One-quantum annihilation, 315. 

Optical properties of impure crystals, 226; 
of pure crystals, 227. 

Orbital angular momentum, 85, 96, 305, 
359-62. 

Organic dyes, 200. 

Orthogonal functions, 375 et seq. 

— property, 64. 

Orthohelium, 122. 

Oscillator strength, 169, 257. 

Overlap forces, 175. 

Oxide layers, 232. 

Oxygen atom, 193. 

Paraholium, 122. 

Paramagnetic salts, 85. 

Paramagnetism, 85, 217. 

Paschen-Back effect, 101, 113, 117. 

Pauli matrices, 97; principle, 117, 130; 

wave equation, 98. 

Periodic field, 208. 

Perturbation independent of time, .252; 
method of obtaining energy levels, 118; 
periodio in time, 250; theory, 71-83, 
145-55, 369. 

Photoconductivity, 3, 224, 226. 
Photoelectric effect, 3, 266. 

Poisson brackets, 356-9. 
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Polar molecules, 80. 

Polarizability, 165, 166, 174. 

Polarization of electrons, 332. 

Positrons, 291, 313. 

Potential barriers, 15; jumps, 13. 

Quadripole moment, 258. 

Quantization with several degrees of free¬ 
dom, 48; of angular momentum, 44, 49; 
of radiation field, 259. 

Quantum Poisson bracket, 358. 

Radiation damping, 275 et seq.; processes 
classified, 274; processes involving 
mesons, 289; theory, 247 et seq. 
Raman effect, 4, 275. 

Rayleigh-Sclirodinger perturbation theory, 
73-7. 

Rearrangement collisions, 244. 
Rectangular barriers, 15. 

Reduced mass, 103, 113. 

Relativistic invariance, 300; quantum 
mechanics, 290 ot seq.; wave equations, 
290 et seq. 

Resonance energy, 200; fluorescence, 275. 
Rigid rotator, 45, 60. 

Rodrigues formula, 381. 

Rotating polar molecules, 80. 

Rotation of a rigid body, 366; of nucleus, 

44. 

Rotational states of diatomic molecules, 

45, 103. 

Rydberg’s constant, 43. 

Scattered wave, 235. 

Scattering by centre of force, 234; by 
Coulomb field, 241; cross-section, 233; 
of a-particles by helium, 113; of electrons 
by a nucleus, 329; of light, 274, 279, 
285; of mesons by nuclear particles, 289; 
of neutrons by protons, 126; of slow 
neutrons by a nucleus, 246. 

Schottky defect, 207. 

— emission, 219. 

Schrodinger equation, 12, 28, 38, 61, 374. 
Screening constant, 186. 

Second-order radiation processes, 274. 
Selection rules, 257. 

Self-consistent field, 119, 138, 214. 
Self-energy, 275. 

Semi-oonductors, 224. 

Simple harmonic oscillator, 50, 53, 261, 

367. 


Size of the electron, 277. 

Specific heats, 41, 46, 216. 

Spin coordinates, 108; greater than one- 
half, 353; matrices, 97, 298; of electron, 
89, 290, 303. 

Spin-orbit coupling, 321. 

Spinors, 351. 

Stationary states, 34 et seq. 

Statistical method of Thomas and Fermi, 
156. 

Statistics of particles, 107. 

Stem-Gerlach experiment, 84, 99, 333. 
Surface spherical harmonics, 387. 
Susceptibility, diamagnetic, 141. 
Symmetrical top molecules, 61. 

Symmetry characteristics of wave 
functions, 105. 

Tetrahedral orbitals, 195. 

Thermionic emission, 218. 

Third-order radiation processes, 274. 
Thomas-Fermi atom, 156. 

Transformation functions, 34, 66. 
Transition probability, 247. 

Transitions to unquantized states, 249. 
Transmission coefficient, 16. 

Transparency of a barrier, 16, 22. 
Transverse self-energy, 275. 

Trapping of electrons, 227, 230. 
Triangular barrier, 18. 

Trigonal orbitals, 196. 

Tunnel effect, 15. 

Two-body problem, 102 et seq. 
Two-quanta annihilation, 275, 315. 

Uncertainty principle, 30. 

Unitary matrix, 368. 

Vacant lattice points, 205, 206. 

Valence forces, 175. 

Van der Waals’ forces, 164 et seq., 204. 
Vibration of a molecule, 162. 

Vibrational states of diatomic molecules, 
46, 103. 

Water molecule, 193. 

Wave functions, 105; for many-electron 
atoms, 127; for the spin, 92. 

Wave velocity, 24. 

WKB method, 19. 

Width of spectral lines, 268. 

Wigner’s perturbation method, 81. 

Zeeman effect, 85, 89, 91. 
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